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1 Introduction 


The Harris Real Time Express (RTX) 2000 Family of microcontrollers is a 
highly integrated family of 16-bit CMOS microcontrollers designed for real-time 
control systems requiring high performance with low power consumption. 


1.1 The RTX 2000, 2001A, And 2010 Microcontrollers 


The architecture of the RTX 2000 Series of products results in high instruction 
execution rates. The highly parallel architecture allows the RTX to perform 
several functions in one instruction cycle, and all instructions execute in either 
one or two clock cycles. Instructions are fetched from memory and executed 
‘immediately; there are no instruction "pipelines" or caches to flush when 
performing branches or calls. | ; 


The RTX 2000, 2001A and 2010 Microcontrollers have on-chip support 
hardware for performing many of the functions typically needed in a real-time 
system, including an interrupt controller, a memory page controller, two stack 
controllers, and three 16-bit counter/timers. In addition to these “on-chip 
peripherals", the RTX 2000 provides a 16-by-16 hardware multiplier, while the 
RTX 2010 provides a 16-by-16 hardware multiplier-accumulator along with a 32- 
bit Barrel Shifter and a 32-bit Leading Zero Detector for Floating Point support. 
Table 1.1 shows a break-out of the features of each of these products. 


The RTX 2000 Class architecture was designed to execute the high-level 
language Forth as its "assembly language". The instruction set provides the 
features necessary for implementing much of the Forth language directly. 
Instructions are available for manipulating stacks, performing memory access, 
controlling program flow, and basic math and logic operations. 


One RTX instruction may combine the functions of two or three high level Forth 


instructions, resulting in an effective processor throughput which is faster than 
the processor clock speed. . 
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RTX 2000 
interrupt 
Controller 


Stack 
Controller 


Two 256-Word 
Stacks 


Three 16-Bit 
Timer/Counters 


1-Cycle 16-Bit 
Multiplier 


RTX 20014 


interrupt 
Controller 


Stack 
Controller 


Two 64-Word 
Stacks 


Three 16-Bit 


Timer/Counters | 


TABLE 1.1: RTX On-Chip Hardware Peripherals 


The stack oriented architecture of the RTX also makes it well suited for running 
such computer languages as C. 


RTX 2010 
Interrupt 
Controller 


Stack _ 
Controller 


Two 256-Word 
Stacks 


Three 16-Bit 
Timer/Counters | 


1-Cycle 16-Bit 


Mult./Accum. 


1-Cycle 32-Bit 
Barrel Shifter; 
Floating Point 
Support 
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1.2 The RTX 2000 Family Programmer’s Reference Manual 


Figure 1.1 offers an overview of the interface between a user and an RTX 
Microcontroller. The documentation which supports each layer of this interface 


is also shown. 


User © 
Interface 


Development 
System 
Software 


_ ATX Forth 
Primitives 


RTX 
_ Hardware 


Support Documentation 


Quick 
Reference 
Card 


Software Reference 
Manuals 


Hardware Reference 
, Manual 


FIGURE 1.1; USER/RTX INTERFACE 
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The Programmer’s Reference Manual describes the RTX 2000, RTX 2001A, and 
RTX 2010 Microcontrollers from a programmer’s point of view, including 
architecture, registers, data paths, hardware interfaces, and primitive instructions. 
Topics described in various sections of this manual include: | 


_ Chapter 
Chapter 
Chapter 
Chapter 
Chapter 
Chapter 
Chapter 
Chapter 


Z 
3 
4 
5 
6 
7 
8 
9 


Chapter 1 


Overall architecture of RTX microcontrollers 
General operation of RTX microcontrollers 
The RTX register set 

Memory Interface . 

On-chip Peripheral Devices 


‘RTX Instruction Set 


Implementation of Multi-step Math Functions 
Implementing Forth on the RTX 
Code Optimization Techniques 


Some functional differences exist between the different members of this family 
of microcontrollers. When such differences exist, the applicable sections of this 
manual describe those differences. Where major differences exist, they are 
broken into separate paragraphs, and are offset with a side bar for clarification. 


For additional information specific to your microcontroller, please refer to the 
appropriate data sheet. 
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2  RTX Architecture 


This chapter provides an overview of the programmer’s model of the RTX 
Microcontroller architecture. Figures 2.1, 2.2 and 2.3 show block diagrams for 
the RTX 2000, RTX 2001A, and RTX 2010 Microcontrollers respectively. 


The RTX microcontroller is a stack based machine with two on-chip stacks. 
Most math, I/O and memory reference operations take their operands from the 
Parameter Stack, and leave their results on the Parameter Stack. Subroutine calls 
use the Return Stack for saving their return addresses. 


There are twenty-three registers on the RTX 2000, twenty-four registers on the 
RTX 2001A, and twenty-five registers on the RTX 2010. These registers control 
processor configuration and status, hold intermediate results during computations, 
and provide an interface between the processor and its on-chip peripheral 
devices. 


The RTX registers and stacks are interconnected through a series of 16-bit data 
buses which transfer data within the processor and with the outside world. 


CONTROL 


FIGURE 2.1: RTX 2000 BLOCK DIAGRAM 
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FIGURE 2.3: RTX 2010 BLOCK DIAGRAM 
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2.1 On-Chip Stacks 


The RTX microcontroller contains two on-chip last-in-first-out (LIFO) stack 

memories. The top elements of each stack are immediately accessible through 

registers. The remainder of each stack is located in on-chip RAM arrays. The 

control logic associated with each stack determines which stack locations are to 

be read or written, and monitors the stacks for overflow and underflow 
conditions. See Section 3.1 for a description of stack operations. 
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Stacks on the RTX 2000 and RTX 2010 are each 256 
elements deep; stacks on the RTX 2001A are 64 
elements deep. | 
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2.1.1 The Parameter Stack aa 
e e 


The 16-bit wide Parameter Stack provides the operands for most math, logic, and 
memory reference instructions. It is used for passing parameters between 
subroutines, and as a scratchpad area for temporary storage of data. 


The top two elements of the Parameter Stack are the TOP Register, which 
contains the top element, and the NEXT Register, which contains the second 
element. For certain instructions, TOP or NEXT are the implicit data source or 
destination, and the RTX can perform operations dealing with TOP and NEXT in 
one clock cycle. For more information about TOP and NEXT, see Chapter 4. 


2.1.2 The Return Stack 


The 21-bit wide Return Stack is used for storing subroutine return addresses and 
for holding index counts for loops and repeated instructions, and can also be used 
as a temporary storage area. The top element of the Return Stack is comprised 
of the 16-bit wide I register and the 5-bit wide IPR Register. The RTX can move 
- data between the top elements of the Parameter and Return Stacks in a single 
clock cycle. For more information about I and IPR, see Chapter 4. | 
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2.2 RTX 2000 Series Bus Architecture 


The RTX 2000 Series bus architecture provides for unidirectional data paths and 

simultaneous operation of some data buses. This parallelism allows for 

maximum efficiency of data flow. External data is transferred via the ASIC Data 

Bus and the Memory Data Bus. Addresses for external access are output via the 
eee) Memory Address Bus and the ASIC Address Bus. 


Zeid Data Buses 


The RTX QUAD Bus™ architecture consists of 4 independent 16-bit data buses, 
all of which may be active simultaneously. 


e The Memory Data Bus carries program instructions and program data 
to and from Main Memory. 16-bit data words (but not program 
instructions) are passed through byte-swapping hardware which allows 
the processor to control the order of storage in memory for the low 
and high bytes of the word. 


e The ASIC Bus™ is the I/O and register interface bus. This bus 
provides the interface between the Parameter Stack and the processor 
registers and external I/O devices. The ASIC Bus passes input data 
through the on-chip Arithmetic/Logic Unit (ALU) before pushing the 
data onto the Parameter Stack. This allows the RTX to perform math 
(adding, subtracting), logic (masking), and shifting operations on the 
data as it is being read. 


e The Parameter Stack Bus carries data between the top-of-stack © 
registers and the Parameter Stack: RAM. 


° The Return Stack Bus carries data between the od Gies registers 
and the Return Stack RAM. 
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2.2.2 Address Buses 


For off-chip communications, the RTX microprocessor has two address buses: 
the 19-bit Memory Address Bus, and the 3-bit ASIC Address Bus. 


e The Memory Address Bus (MA19-MA0O1) carries the address of the 

_ Main memory location to be accessed, either for instruction fetches or 

memory read/write operations. This is a 19-bit bus, along with Upper 

Data Strobe (UDS) and Lower Data Strobe (LDS), which allows the 
RTX to address 1 megabyte of memory. 


e The ASIC Address Bus (GA02-GA00) carries address information for 
external ASIC devices. 


See Chapter 5 for information about RTX External Bus Interfaces. 
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2.3 Registers 

The RTX 2000 Series microcontrollers contain three types of registers. Stack 
related registers, Status/Control registers, and Internal Processor registers. 
2.3.1. Stack Related Registers. 

Stack related registers contain the top elements of the Parameter and Return 
Stacks. These registers are the implicit source and destination for many of the 
processor operations, and are described in detail in Chapter 4. 

2.3.2 Status/Control Registers 

Status/Control registers are accessed through the ASIC Bus, and determine the 
Operating environment for the processor by controlling the processor 
configuration and on-chip penpnets devices. These registers are described in 
detail in Chapter 4. 

2.3.3 Internal Processor Registers 


Internal Processor registers are not directly accessible to the programmer, and 
are described in Chapter 4. 
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2.4 “Memory 


_ The RTX 2000, RTX 20014, and RTX 2010 Microcontrollers directly address 

1 Megabyte (512K 16-bit words) of memory.. This memory is divided into 16 

pages of 64K bytes (32K words) each, and may be made up of any combination 
of ROM, RAM, or memory mapped V/O devices. 


nee RTX memory interface is described i in detail j in n Chapter 2; 
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3 Operations | 


This chapter provides an overview of the internal processor operations. The 
operations are described in greater detail in Chapter 7, "Instruction Set". 


3.1 Instruction Execution 


The RTX Microcontrollers have an Instruction Decoder which provides control 
of all data paths and the Program Counter Register (PC). This hardware 
determines what function is to be performed by looking at the contents of the 
Instruction Register (IR), and subsequently determines the Sequence of operations 
through data path control. 


In one-cycle operations, the instruction which is to be executed is latched into IR 
at the beginning of a clock cycle, then is decoded. All necessary internal 
operations are performed simultaneously with fetching the next instruction. See 
Figure 3.1. 


Instructions which perform memory access require two clock cycles to be 
executed. During the first cycle of a memory access instruction, the instruction 
is decoded, the address of the memory location to be accessed is placed on the 
Memory Address Bus (MA19-MA0O1), and the memory data (MD15-MD00) is 
read or written. During the second cycle, the address of the next instruction to 
be executed is placed on the Memory Address Bus, and the next instruction is 
fetched, as indicated in Figure 3.1. : 
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FIGURE 3.1: INSTRUCTION EXECUTION 
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3.2 Stack Operation 
The RTX Microprocessors utilize a Last-in, First-out (LIFO) stack architecture. 


In this type of architecture, the last data element stored in the memory stack will 
be the first element retrieved from that region of memory. See Figure 3.2. 


rq} 030)) Ce ee | ce oe 
CO) ) oo} ) ee) 
oo as | 


FIGURE 3.2: STACK OPERATION 


This structure for information storage and retrieval provides the computer with 
one central location for temporary storage of information. 


The RTX takes advantage of this architecture, utilizing two separate on-chip 
stacks. The first, the Parameter Stack, is used for temporary storage of data and 
for passing parameters between subroutines. The second, the Return Stack, 1s 
used to store return addresses during subroutine calls and returns. The Quad | 
Bus™ architecure of the RTX Microcontrollers allows both stacks to be accessed 
‘in parallel by a single instruction, this dual stack arrangement allows overhead 
to be minimized during subroutine operations. The Return Stack can also be 
used for temporary storage of values when it is not being used during a 
subroutine call or return. 


For faster access, both the Parameter Stack and the Return Stack utilize registers 
for the top elements and on-chip memory (Stack Memory) for the remaining 
elements. 


For more detailed information about RTX stack operation, see Section 6.1. 
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3.3 Subroutine Calls and Returns | 


An RTX subroutine call instruction has the address of the routine to be called 
embedded in the instruction. When the subroutine call is executed, the address 
of the instruction following the call instruction is pushed onto the Return Stack. 

When the subroutine is completed, a Return-from-Subroutine instruction will pop 
the return address from the stack, and execution will resume with the instruction 
following the call. 


The RTX architecture is optimized for performing subroutine calls and returns 
with minimum processor overhead. A subroutine call within the same memory 
page can be made in one clock cycle. A call to a location in a different memory 
page takes 3 clock cycles. 


Subroutine returns take 0 clock cycles if performed as part of another instruction, 
and 1 cycle if executed as a separate instruction. 
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3.4 Branching And Looping 


The RTX can perform unconditional branches or conditional branches, based on 
the contents of the top elements of the Parameter and Return Stacks. All 
branches take one clock cycle, regardless of whether or. not the branch is 
performed. 
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3.5 _ Streamed Instructions _ 


_ The RTX processor has a “streamed ‘ instruction feature, in which an instruction 
is repeated a specified number of times. without repeating the instruction fetch 
cycle. This feature is useful a doing fast data transfers, loops and some math 
functions. ee i nes 


ee a ‘See Chapter 1, “Instruction Set" for n more details about the: "streamed" instruction is 
feature. | 


yr a RTX 2000 Family Programmer's Reference Manual 


3.6 Math/Logic Operations 


Math and logic operations are performed by the ALU circuitry of the RTX. The 


operations which may be performed include the simple math operators + and = . 
and the logic operators AND, OR, XOR, NOR, NAND, XNOR, and NOT. 


See Section 6.3 for information about the on-chip hardware 
multiplier, multiplier/accumulator, barrel shifter, and Floating 
Point support features. | | a 


The TOP register is always one input to the ALU. The second, “Y", input may 
come from a variety of sources, as indicated in Figure 3.3. 
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FIGURE 3.3: ALU DATA FLOW 
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3.6.1 Registers And I/O Devices 

The contents of the TOP and NEXT registers are always available as operands to 
the ALU, and are the implicit operands for most of the RTX Math/Logic 
instructions. 


The contents of the other registers and external I/O devices are addressable as 
devices on the ASIC Bus. 


3.6.2 Memory — 
Data may be fetched from, and stored to, Main Memory using the Word and 


Byte access instructions (Classes 14 and 15 in Chapter 7, the “Instruction Set") 
and User memory access instructions (Class 12). 


3.6.3 Literals 
A literal is a constant value to be pushed onto the stack, or to be used as the 
second operand of an arithmetic or logic operation. The RTX processor 


recognizes two types of literals - short literals and long literals. 


A short literal j is a 5-bit value between 0 and 31 and is encoded as a field ina 
machine instruction. 


A long literal may be any signed or unsigned 16-bit integer, and is stored in main 
memory immediately following the opcode that utilizes it. 
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3.7 Stack Operations 


The top two locations of the Parameter Stack are TOP and NEXT, and the > 
remainder of the stack memory is located in on-chip RAM. Because of this, the 
RTX Microcontrollers have the ability to manipulate stack elements to allow 
optimization of many instructions. - Descriptions of these stack manipulation 
operations are given in the following sections. These primitives can be combined 
with other operations to allow one-cycle execution of multiple operations. See 
Chapter 7 for information about specific instructions. 


3.7.1 DUP 


pup copies the top element of the Parameter Stack, and pushes the result onto the. 
stack, leaving the stack with two identical elements in the top two stack locations. 


Parameter Parameter 
Stack Stack 


Before After 


FIGURE 3.4: STACK EFFECTS OF DUP 
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3.7.2 SWAP 


- swap flips the top two siements of the Parameter Stack, causing the top element 
to move to the second location, ‘and the second. oa to move to a top 7 


location. 
Parameter. | 7 Parameter 
Stack ; 7 7 Stack | 
Before’ After 
y, LEB es | | 
FIGURE 3.5: STACK CTS OFSWAP 
3.7.3 DROP 


DROP pops the Parameter Stack, firepoing the top element. That element i is lost, | 
and 1 is not ee in subsequent Operations. _ ) 


Parameter 4 Parameter 
‘Stack |, Stack 


Before nee After | 


"FIGURE 3.6: STACK EFFECTS OF DROP 
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3.7.4 OVER 


OVER copies and pushes the third Parameter Stack element into the top location. 


Parameter Parameter 
Stack Stack 


Before After 


FIGURE 3.7: STACK EFFECTS OF OVER 


3.7.5 >R 


>R (called "to R") takes the information in TOP and stores it in the least 
significant 16 bits (1) of the top location of the Return Stack. This causes the 
current Code page value to be written to IPR, the most significant 5 bits of the 
top location of the Return Stack. 
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FIGURE 3.8: STACK EFFECTS OF >R 
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3.7.6 R> 


r> (called "R from") retrieves the information in the least significant 16 bits of 
the top element of the Return Stack and pushes it into TOP. 
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FIGURE 3.9: STACK EFFECTS OF R> 


3.7.7 R@ 


ra (called “R fetch") copies the top of the Return Stack to the top of the 
Parameter Stack. 


| Before ; | ; | After 
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FIGURE 3.10: STACK EFFECTS OF R@ 
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3.8 Interrupts 


The RTX processor may be interrupted from several sources, both from internal | 
devices and from external inputs. | 


The on-chip Interrupt Controller has fourteen interrupt request inputs. Thirteen 
of these interrupt request inputs are maskable interrupts, and one is a Non- 
Maskable Interrupt (NMI) request. 


3.8.1 Maskable Interrupts 


The Interrupt Controller samples the request inputs during each instruction, 
prioritizes any active interrupt requests, and signals the processor when an 
interrupt request is present. 


For more information about interrupt acknowledgement, disabling interrupts, and 
software interrupts, see Section 6.2. 


3.8.2 Non-Maskable Interrupts (NMI 


The NMI is an external, edge-sensitive input which requires a rising edge to 
request an interrupt. 


3.8.2.1 On the RTX 2000 


The NMI can cause the processor to perform an Interrupt Acknowledge 
cycle in the middle of such operations as Step Math instructions, Streamed 
instructions, and other operations that could result in the loss of data or 
misoperation of the hardware if interrupted. For this reason, a "Return 
From Subroutine" should not be performed from the NMI service routine. 
Instead, the NMI handler should re-initialize the system. 
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3.8.2.2 On the RTX 2001A 


On the RTX 2001A, the NMI input has a glitch filter circuit which 
requires that the signal that initiates the NMI must last at least two cycles 
of ICLK. | | 


The NMI can cause the processor to perform an Interrupt Acknowledge 
cycle in the middle of such operations as Step Math instructions, Streamed 
instructions, and other operations that could result in the loss of data or 
misoperation of the hardware if interrupted. For this reason, a “Return 
From Subroutine" should not be performed from the NMI service routine. 
Instead, the NMI handler should re-initialize the system. 


3.8.2.3 On the RTX 2010 


On the RTX 2010, the NMI has two modes of operation which are 
controlled by the NMI MODE Flag (bit 11 of the CR). 


When CR bit 11 is cleared (=0), the NMI cannot be masked and can 
interrupt any cycle. This allows a fast response to the NMI, but does not 
guarantee that a Return From Interrupt will always provide correct 
operation. The NMI MODE Flag is cleared at Reset. 


When the NMI_ MODE bit is set (=1), the NMI may be inhibited by the 
processor during certain critical operations, and further NMIs and 
maskable interrupts are disabled until the NMI Interrupt Service Routine 
has been completed and a return has been executed. In this mode, a 
return from the NMI Interrupt Service Routine will allow the processor 
to resume correct execution at the point where it was interrupted. 
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4 RTX Registers 


The three types of registers which the RTX microcontrollers use are: Stack 
Related Registers, Internal Processor Registers, and Status/Control Registers. 


At power up or Reset, the RTX registers are initialized. The reset states for the 
RTX 2000 are shown in Table 4.1. The reset states for the RTX 2001A are 
shown in Table 4.2. The reset states for the RTX 2010 are shown in Table 4.3. 
In each of these tables, the read and write capabilities of each register are 
indicated in the R/W column, where: 7 


R-W Indicates that the register can be either read from or written to. 

R | Indicates a read-only register. 

WwW Indicates a write-only register. 

R/W _Indicates that the first register is read-only and the second register 


is write-only (as in the case of the cous and Timer 
Preload Registers). 3 


* Indicates that individual bits in the register may be read-only or 
write-only and that the bit map for that register should be consulted. 


N Indicates that the register cannot be read from or written to. 


Register addresses are given in hexadecimal, denoted by "A here and elsewhere 
in this manual. 


The sections which follow describe each of the registers in more detail. 
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TABLE 4.1; RTX 2000 REGISTER INITIALIZATION 


\ 


INITIALIZATION 
REGISTER [ADDR VALUES 


=| aa 


at 
02H 


0100 0000 0000 1000. Interrupts disabled, BOOT=1, Byte 
Order=0 


mow [earn [ 
boll heercanc! Silt aa. 
FoI (idl Sells ell A 
Kies CJD Kecalbsentoestad lal et screamer 
(ic cll Nerchssonendiacs) Liat Mi dict enc 
SER [op fasts tity tity sity] we | Linit for each stack set to 255 


0000 0010 0000 0000 Read only; initialized to 
“No yanterrupt Value" 


bias Dredd Gocbeendemedaved Lied Kacksnaskaeticondonok a 
FE a) a a 
| [UPR ——_| oe [0000 0000 6000 0000] R-w initialize for User Page 0 =. | 
Ue SIR (ecard bal nce ie ca 


0000 0000 0000 0000 iAtelnt Basex0, Counters on 
internal clocks, no rounding, 
use CFR for data accesses 
bl Kl Cahn Ll clan 


0000 0000 0000 0000 Alt Timer/Counters set to 
time-out after 65536 counts 


mata 1111 1111 0000 0000 PR | Read! only; Mult. Low Product 
a oe 
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TABLE 4.2: RTX 2001A REGISTER INITIALIZATION 


INITIALIZATION 
REGISTER j|ADDR VALUES R/W | COMMENTS 


Ee 
Cen RE 0) ee ee 


00H | 
O1n 
02H 
| ~ T0100 oe 0000 1000 Interrupts disabled, BOOT=1, 
Byte Order=0 
IMR =| 08H 0000 0000 0000 0000 Rw ALL interrupts unmasked 


sox [om | 0000 0011 0000 0011 Row | Stack underflow limits set 


Write only; each stack overflow 
limit set for max. stack size 
0000 0010 0000 0000 Read only; Interrupt Vector init- 
ei jialized to "No Interrupt” value 
IPR | oce | 0000 0000 0000 0000 Rd Initialized for Code Page 0 
[DPR | 00H 0000 0000 0000 0000 Row Initialized for Date Page 0 


|UPR | EH 0000 0000 0000 0000 Rw Initialized for User Page 0 
}CFR | OFK | 0000 0000 0000 0000 RoW | Initialized for Code Page 0 


anne a 0000 0000 0000 0000 Interrupt Base=0, Counters on 


internal clocks, no rounding, 


use CPR for data accesses 
TCO/TPO 0000 0000 0000 0000; R/W | ALL Timer/Counters set to 
TCI/TP1 R/W | time-out after 65536 counts 
TC2/TP2 R/W 


2 Be BRIER PRINTS?» 
PS Sa reed 
oosecon S ees 


ee 1 one 
% irene "es 
es ORS 
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TABLE 4.3: RTX 2010 REGISTER INITIALIZATION 


INITIALIZATION 
REGISTER |ADDR VALUES 


ror | [ono wn worvooftw | 
Ret | [aera | 
Lan NE! LL i 


on 
02H 
0100 0000 0000 1000 | Interrupts disabled, BOOT=1, 
Byte Order=0 


Ec KR 
———— 
Em CA) 
[en [ nso vos omo cw [ AT Terns weno 
ex [om [oe so mio wee [sek var ree ee 
SY o_o aie [| ak earn nes vee 


aa onty: each stack overflow 
init set for mex. stack size 
0000 0010 0000, 0000 Reed only; Interrupt Vector init- | 
jalized to “No Interrupt" value 
PR | och 0000 0000 0000 0000 [re initialized for Code Page 0 
DPR | OOH 0000 0000 0000 0000 ita initialized for Data Page 0 


}UPR | OH | 0000 0000 0000 0000 Row | initialized for User Page 0 
[CPR | OFH | 0000 0000 0000 0000 Rew | initialized for Code Page 0 } 


0000 0000 0000 0000 Interrupt Basez0, Counters on 
internal clocks, no rounding, 
use CPR for data accesses 

UBR | 11H | 0000 0000 0000 0000 Rw | User Base address set to 0 ) 


0000 0000 0000 0000 MAC Extension Register; LZ2D 0 Count; 
Barrel Shifter Count 
TCO/TPO 0000 0000 0000 0000 | All Timer/Counters set to. “e 
TC1/TP1 time-out after 65536 counts 
TC2/TP2 | 


Loam 0000 0000 0000 0000 aw Mul tiplier and MAC Low Register | 


0000 9000 0000 0000 Multiplier, Barrel Shifter, and L2D | 
High Register; MAC Middle Register 
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4.1 Stack Related Registers 

These registers contain the top elements of the Parameter and Return Stacks, and 
are the implicit source and destination for many of the processor operations. 
4.1.1 TOP Register | (Parameter Stack) | 


~The TOP Register contains the top element of the paramere Stack, and has no 
ASIC address assignment. 


_ This is the primary working register for the processor, and is the implicit data 
source or destination for certain instructions. 


All ALU results are loaded into TOP. The output from TOP may be written to 
any ASIC Bus register and to external I/O devices. | 


4.1.2 NEXT Register (Parameter Stack) = : 


The NEXT Register contains the second element of the Parameter Stack, and has 
‘no ASIC address assignment. 


During arithmetic operations, this register holds the lower 16 bits of a 32-bit 
operand. _NEXT is also the source of data for all memory writes. 
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4.1.3 1 Register _ (Return Stack) 


The Index Register, I, can be accessed at three different ASIC addresses, and the 
choice of ASIC address determines the type of operation to be performed. 


As a Stack Related Register at ASIC addresses 00H (Hex) and 01H, I contains 
the lower 16 bits of the top element of the 21-bit wide Return Stack. IPR 
contains the other 5 bits. See Section 4.1. 4 for more details about IPR. 


The contents of I may be accessed in either push/pop mode, in which values are 
moved to/from Return Stack memory as. required, or in read/write mode in which 
the Return Stack is not affected. . 


In addition to its use in holding return address bits, at ASIC address 02H, this » 


register is also used to hold the count for streamed (repeated) instructions and 
loop instructions. Operation of I at this ASIC address is described in more detail 
in Section 4.1.3.3. I access operations and the associated addresses are shown 
in Table 4.4. 

TABLE 4.4: I ACCESS OPERATIONS 


PERATION | RETURN 
pclae BIT aporaas 
g-write) VALUE ggggg 


ee 
ee 
Tw 0 | 00000 | | Penet conmnis BS oD with pun ofthe Rem Stack) 
Fe es oe ene 

[a | 0 | e000 |_| Pantene cone onic IISE porn eReumSick 


00001 Paes te cota one IEG without popeng te Rtn Slack on 
executes the Subroutine Retum 


i SO 8 Pushes te contents cfg ro poneing te Paaete Sack | 
| 1 {| oor | go | Performs a Subroutine Retum, then pushes the contents of [i@Ig into ] 


00010 Puahes the contents of] shifted left by one bit, nto IES 
(the Retum Stack is not popped) 

00010 Pushes the contents of J shifted left by one bit, into [Xe]g (the Return 
Stack is not popped), then performs @ Subroutine Retum 

00010 Pushes the contents of [ieIy into [Jes a “stream” count, indicating that 
the next instruction is to be performed a specified number of times; 
the Parameter Stack is popped . 


| ow | 1 {| ooo | go | Balers s broire tn Uennisies te ween tine i 
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4.1.3.1 I | At Address 00H 


Location 00H is used to access I without causing any net pushes or pops of the 
Return Stack. . 


Reading from this location pushes the contents of I onto the Parameter Stack. 


‘Reading from this location as part of a subroutine return pushes the contents of 
I onto the Parameter Stack, then performs a Return-From-Subroutine. 


Writing to this location during normal operation pops the top item on the 
Parameter Stack into I; the original contents of I are lost. . 


Writing to this location as part of a subroutine return operation first executes the 
return, then pushes the top item of the Parameter Stack onto the Return Stack. 


Ses! “S v 
°, veteran Y & 

: 2 eve se one 
0,000. @, 2,2, OO v 
POE S ens 
4.1.3.2 I | At Address 01H a3 ee 

@ e e 6 ro RPK Ne XX 

se pee OS oO ae s 
& o. , 
x = . RED SOND 


I at address 01H is used to push and pop the Return Stack. 


Reading this location during normal operation pushes the contents of I onto the. 
Parameter Stack and pops the Return Stack. — | 


Reading this location as part of a subroutine return operation pushes the contents 
of I onto the Parameter Stack Vio.) or mo the Return Stack, then executes _— 
the subroutine return. See Sectio for more information cea come : 
during subroutine returns. . Gee aida 5. | 


Writing to this location during normal operation pushes the top item from the“ 
Parameter Stack onto the Return Stack, popping the Parameter Stack. 


Writing to this location as part of a subroutine return operation first executes the 


subroutine return, then pushes the top Parameter Stack item onto the Return 
Stack. See Section 5.1.1 for more information about subroutine return operation. 
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4.1.3.3 I | At Address 02H (Stream Count/Loop Count) 


Reading this location pushes the. contents of I shifted left by one bit onto the 
{ Parameter Stack. The Return Stack is not popped. 


© : os iw “\ Reading this location as part of a subroutine return pushes the contents of I 


GV. 5 t shifted left by one bit into TOP (the Return Stack is not popped), and then 
7 Pi W performs a Return-From-Subroutine. 
V ¥y Writing to this location aides normal operation pushes the top Parameter Stack 


item into I as a "stream" count, indicating that the next instruction is to be 
acspecified number of times} the Parameter Stack is popped. 


"= 24! Writing to this location as part of a subroutine return operation executes the 
= =< ))\ subroutine return first, then pushes the stream count onto the Return Stack. 


FOV pth Or 
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4.1.4 IPR Register (Return Stack) 


The IPR Register, at ASIC Address OCH (Hex), can be described as both a Stack 
Related Register and as a Control/Status Register. See Section 4.3.10 for more 
information. This register contains the 5 most significant bits of the top element 
of the Return Stack (the I Register contains the other 16 bits). 


Reading from or writing directly to IPR does not push or pop the Return Stack, 
but pushes or pops of the Return Stack (when reading or writing to I) do cause 
the contents of IPR to be overwritten. Writing to I during non-subroutine 
Operations causes the current Code Page value to be written to IPR. 


dak bi te 


RTX Registers, Chapter4¢ 43 


rw 


4.2 Internal Processor Registers 
Internal Processor Registers are not directly accessible to the programmer. 
The Instruction Register, IR, is actually a: latch which contains the instruction 


currently being executed. This register is loaded directly from main memory via 
an instruction fetch, and is not accessible under program control. 


The bits of the instruction in IR are decoded to determine which operations to 
perform, to determine the location of the next instruction to be executed, and to 
provide data for immediate operations. 
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4.3 Control/Status Registers 


The contents of the RTX microcontroller’s Control/Status Registers determine the 
Operating environment for the processor, and allow the processor to monitor and 
‘control the various uO devices on the chip. | 

7 
All internal registers are accessed through the ASIC Bus. ASIC addresses oe” ~ 
through 23 (17 hexadecimal) are assigned to on-chip registers and devices, and 
are described in this section. Section 7.7 describes the RTX instructions which 
access the ASIC Bus. 
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4.3.1 The Configuration Register - Address 03H - 
The Configuration Register, CR, controls the setup/status of the RTX processor. 


Reading this location pushes the current | ‘contents of the register onto the 
Parameter Stack. 


Writing to this location pops the top Parameter Stack item into CR, updating the 
control bits. The Interrupt Base/Control Register contains additional processor 
control bits. 


The bits in CR are assigned as shown in Table 4.5. 
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TABLE 4.5: CONFIGURATION REGISTER BIT ASSIGNMENTS 


Read-only; Interrupt Latch: When set to 1, 
indicates that an interrupt request is 
pending. See Section 6.2. 


Read-only; Read Interrupt Disable: Status 
of Interrupt Disable bit. When set to 1, 
indicates that interrupts are disabled. 
Resets to 1. Use SID bit to set value. 
See Section 6.2. 


Reserved on the RTX 2000 and RTX 2001A. 
On the RTX 2010: When this bit is set, the | 
PCLK cycle for every ASIC bus read is | 
extended. See Section 5.1 for more’ details. | 


Reserved on the RTX 2000 and RTX 2001A. 
On the RTX 2010: When this bit =1, return 
from a Non- Interrupt can be made 


See Section 3.5.2.3 for more informati n. 


Reserved 
Write- only, { lways | reads as. “zero; ee LRA 
Set Interrupt “Disable: When set to 1, 
processor will not respond to interrupts. 
RID bit contains true value of Interrupt 
Disable bit. See Section 6.2. 


R/W; BOOT: Controls BOOT output pin. May 
be used to select boot memory on power up. 


R/W; Byte Order: Controls order in which 
bytes of data will be read from or written 
to memory. See Section 5.22% 1 BS 


R/W;. Complex Carry: Carry bit from ALU 
extension. See Section 8. 3. 
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4.3.2 The Mp Register - Address 04H 


The MD Register is used to hold intermediate values during step math operations 
(see Chapter 8). It may also be used as a general purpose scratchpad register. 


Reading this location pushes the contents of the MD Register onto the Parameter 
Stack. | 


§ 


Writing to this location pops the top Parameter Stack item into the MD Register, 
replacing its previous contents. | 


4.3.2.1 Mp On The RTX 2000 and RTX 2010 


ee s On the RTX 2000, MD is the Multistep Divide Register. During 
Bos multistep divide operations, this register holds the divisor, while TOP and 
NEXT hold the 32-bit dividend. 


4.3.2.2 MD On The RTX 2001A 


On the RTX 2001A, MD is the Multiply/Divide Register. This register 
holds the divisor during Step divide operations (the 32-bit dividend is in 
TOP and NEXT). During step multiply operations, this register holds the 
saa ge while NEXT holds the multiplicand. 


Ths ae shy Hiply Se anki ne 


pry Ie 200d, = This 
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4.3.3 The SQ Register - Address 05H 
This address is a "pseudo-register" for step math operations (see Chapter 8). 
Reading this location reads the contents of the MD Register, shifts the result left | 


by one bit, then logically OR’s this value with the contents of the SR Register. 
The result is pushed onto the Parameter Stack. 


7 ap A | Writing to this location shifts the top Parameter Stack item left by 8 bits, then 
| pops this value into the MD Register. | 
‘i | - fF 2eVv0S shi Flea ante (pu? 
ov 7 | a b bits £ 


4.3.4 The SR Register - Address 06H 


-The Square Root Register is used to hold intermediate values during the Oa at 
_ calculation of equate roots. It may also be used as a general purpose scratchpad fee 
register. . ee 


Reading this location pushes the contents of the SR Register onto vifie Parameter 
Stack. 


Writing to this focation pops the top Parameter Stack item into the SR Register, 
replacing its previous contents. 
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4.3.5 The pc Register - Address 07H 


The Program Counter Register, PC, contains the lower 16 bits of the address of 


_ the instruction following the one currently executing. 


Reading this location pushes the contents of the PC (the address of the instruction HY L 
following the one which reads the PC) onto the Parameter Stack. mee | on 4 


address contained in the top Parameter Stack item; the Paramétér 
popped. Writing to this location as part of a subroutine return operation pushes 
the top Parameter Stack item onto the Return Stack, then executes the subroutine 
return; the Parameter Stack is popped. : | 


See Table 4.6 for PC Register access operations. 


TABLE 4.6: PC REGISTER aa OPERATIONS 


ae a A Pushes the contents of 28 ito ITEIE 
a Pushes the contents of 2d into IEE, then performs a Subroutine Retum 


Performs a Subroutine Cal! to the address contained in §I15 popping oer 

the Parameter Stack 

Pushes the contents of [X6Ig onto the Retum Stack betore executing 
: _the Subroutine Retum 
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4.3.6 The Interrupt Mask sailed - Address 08H 


The bits in the Interrupt Mask Register, IMR, cause individual interrupt request 
inputs to the Interrupt Controller to be enabled or disabled. When a bit is set to 
1, the corresponding input is masked (disabled). The IMR resets to all 0’s - all 

— Interrupts unmasked. Only NMI, the Non-Maskable Interrupt cannot be masked. | 


Reading this location pushes the current contents of the IMR onto the Parameter 
Stack. 


| Writing to this location pops the top Parameter Stack item into the IMR, updating © 
the mask values. See Table 4.7 for bit assignments. | 
TABLE 4.7: INTERRUPT MASK REGISTER BIT ASSIGNMENTS__. 


[res [nes sus ]exs]eve]ers]ralr1]rofera]esv[psv]asufpsulerses) iss 
aed Bits 14- 5 Reserved. Always read as 0; should 7 

be set =0 during Write operations. 
Ce Ce 
res [oie entre tat Pn 
ree [aie | exert pe rin 
pes [aie [exter im rins =| 
a 
rei [aie 6 | extermt iawn? 
[ee [aie 5 [astern sack ovrttow 
Prev [ aie «| Paramtar stack Overflow 
[ev [eie # | petarn sack unrfiow 
ee 
[er [wet [erent owaries 


Reserved. Always reads as 0. 
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4.3.7 The Stack Pointer Register, SPR - Address 09H 


This location contains the combined registers for the Parameter Stack Pointer and 
Return Stack Pointer, which are accessed together. Bits 0-7 contain the pointer 
for the Parameter Stack, bits 8-15 contain the pointer for the Return Stack. 


Reading this location pushes the contents of the register onto the Parameter 
Stack. The value read for the Parameter Stack pointer will reflect the Parameter 
Stack contents after the register value is pushed. 


Writing to this location pops the top Parameter Stack item into the Stack Pointer 
Register. 


4.3.8 Address OAH 


The assignment and utilization of this Priete is different for the RTX 2000, 
RTX 2001A, and RTX 2010 Microcontrollers. 


i 4.3.8.1 On The RTX 2000 
This location is reserved on the RTX 2000. 


4.3.8.2 On The RTX 2001A and RTX 2010 


On the RTX 2001A and RTX 2010, this address is used for the Stack 
Underflow Limit Register, SUR. This register holds the underflow limit 
values for the Parameter Stack and the Return Stack, which must be 
accessed together. 


This register can be utilized to define the use of substacks for both 
stacks. See Section 6.1.3 for more stack/substack configuration 
information. 
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4.3.9 Address 0BH: IVR, SVR, And SLR 


This address serves as two registers, and may be utilized by either the Interrupt 
Controller or the Stack Controllers, depending on whether a read operation or 
a write operation is being performed. 


In the read-only mode, this is the Interrupt Vector Register on all RTX 2000 
Family Microcontrollers, and is used to hold the current Interrupt Vector value. 
This register is initialized to the "No Interrupt" value. Reading this location 
pushes the value of the current vector being generated by the Interrupt Controller 
onto the Parameter Stack and clears any pending Timer/Counter interrupts. 


In the write-only mode, this address is utilized for stack limit operations by the 
Stack Controller. The specific function of this address differs depending on 
which processor is being used. 


4.3.9.1 Write-only On The RTX 2000: SLR 


In the write-only mode, this address is used as the Stack Limit Register. 
At Reset, this register is set to its maximum value of 255. 7 


Writing to this location loads new values into the Parameter Stack and 
Return Stack Limit Registers. Bit 0-7 are assigned to the Parameter 
Stack, bits 8-15 to the Return Stack, and both are accessed together. 


| 4.3.9.2 Write-only On The RTX 2001A: SVR 


In the write-only mode, this address is used for the Stack Overflow Limit 
Register and holds the overflow limits for the Parameter Stack and the 
Return Stack. These limits must be accessed together. The maximum 
overflow limit value for each stack on the RTX 2001A is 64. 
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4.3.9.3 Write-only On The RTX 2010: SVR 


In the write-only mode, this address is used for the Stack Overflow Limit 
Register and holds the overflow limits for the Parameter Stack and the 
Return Stack. These limits must be accessed together. The maximum 
overflow limit value for each stack on the RTX 2010 is 256. 


Bocce 
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4.3.10 Index Page Register - Address 0CH 


This 5-bit register contains bits 16-20 of the top item of the Return Stack. Bits 
0-3 of the Index Page Register, (IPR), contain the contents of the Code Page 
Register at the time the current subroutine was called (i.e., the memory page 
number to which the processor will return when execution of the current 
subroutine has been completed. Bit 4 contains the value of the Data Page 
Register Select Bit (DPRSEL) at the time the current subroutine was called. See 
Figure 4.1 and Section 5.2.2. 
Bit Assignments During Subroutine Operations 


20;191817,16115:141312111091817)6,5;413;2,1,0) 
: 


-—— » Type of Retum 


interrupt Returne: 
BROs 1 
Subroutine Retums: 
Defines Retum BkO= 0 
Addrees 
Where DPRSEL Bit is 


stored during interrupt 
or Subroutine Call 


BR Assignments During Non - Subroutine Operations 
IPR 


| Scenes? Sane 
unc pitinawumamEnEEnnetmesneneseene? 


hens Si6 Sey) ae 


{indeterminate _/ ne 


FIGURE 4.1: RETURN STACK BIT ASSIGNMENTS 


The Index Page Register provides a mechanism to access the upper bits of the 
subroutine return address. Reads and writes to the IPR do not pop or push the 
Return Stack. However, operations which push and pop the Return Stack do sit; 
overwrite the contents of IPR. These wo the Inder Re $ subroutine calls, 


subroutine returns, and reads and writes to the Index Register at_ ASIC Bus-—--~~ 


_, addresses 01H and 02H. Fy yah aya OOK S 


Reading this location pushes the contents of the IPR onto the Paramieter-Stack- er, er 


Writing to this location loads a new 5-bit value into the IPR. This operation 
should be used with caution, because it will change the subroutine return address. 


% 
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4.3.11 Data Page Register, DPR - Address 0DH 


When the DPRSEL bit (bit 5 of the IBC Register) is set =1, this 4bit register 


reference instructions. See Sections 4.3.14 and 5.2.1. 


av des ~seensains the number of the memory page which will be accessed by memory 


4.3. 12 User Page Register, UPR - Address OER 


This 4-bit register contains the number of the memory page whieh will be 


Instructions in a Ts 


-accessed by User Memory Space Instructions. See set Memory Access 


Ei . 
2 433. 13 Code Page Register, cpr - Address 0FH 


This register contains the number of the merhory page which will be accessed by 


56 


all instruction fetch cycles. Additionally, if the DPRSEL bit is set =0, this — 
i ontain the number. for the memory Page to be. accessed a memory 
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4.3.14 Interrupt Base/Control Register - Address 10H 


The bits in this register control special processor setup and configuration values. — 
See Table 4.8 for the IBC Register bit assignments.. See Section 4.3.1 for — 
information about additional control/status bits in CR. 


TABLE 4.8: IBC REGISTER BIT ASSIGNMENTS 


[sin nl [fof pwr fee] = [= [= [=] em 


1B0-1B5 Bits 


‘Interrupt Vector Base Address: Provides bits 10-15 of 
Interrupt vector generated by the Interrupt Controller 
during an INTA cycle. See Section 6.2. 


Timer Clock Select: Determine the source for the input | 


clock signals for the 3 Counter/Timers. See Sec. €. 


CYCEXT on the RTX 2000 and 2001A: When =1, extends bus 
cycle by 1 PCLK period for every INTA cycle or User 
Memory Instruction cycle. See Sec. 5.1.1 and 5.1.2. 
CYCEXT on the RTX 2010: Allows extended cycle length | 
tor User Memory Instruction cycles. See Sec. 5.1. 


7 


On the RTX 2000 and RTX 2010: ROUND option; when set 
to 1, the least significant 16 bits of the multiplier 
output are rounded into the most significant 16 bits. 
See Section 6.3. 

On the RTX 2001A: Reserved; should be set to 0 during 
write operations. 


Data Page Register Select: Determines whether source 
of bits 16-19 of Memory Address Bus are from CPR or 
DPR for memory access instructions. See Sec. 5.2.2. 


On the RTX 2000, these bits are reserved and should be 
0-4 set to 0. 

On the RTX 2010 and RTX 2001A, these are used as read- 
only stack controller flags where: 


Bit 0 Fatal Stack Error Flag; 

Bit 1 Parameter Stack Underflow Flag; 
Bit 2 Return Stack Underflow Flag; 
Bit 3 Parameter Stack Overflow Flag; 
Bit 4 Return Stack Overflow Flag. 
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4.3.15 User Base Register, UBR - Address 11H 
The contents of this register point to the beginning of a 32 word memory block 


which will be used for all User Memory Access instructions. See Section 5.2.3 
for information about User Memory Space. 


4.3.16 Address 12H 
The function of this address is determined by the RTX processor being used. 


4.3.16.1 On The RTX 2000 and RTX 2001A 
This location is reserved on the RTX 2000 and RTX 2001A. 


4.3.16.2 On The RTX 2010: 


The MAC Extension Register, MXR, is a 16-bit read/write register which 
holds the most significant 16 bits of the MAC Accumulator. For the 

Barrel Shifter instructions, this register holds the shift count. For the 
Leading Zero Detector instructions, the leading zero count is stored in 
this register. | 
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4.3.17 Timer/Counter 0 - Address 13H 


Reading this location pushes the current contents of Timer/Counter 0 onto the 
Parameter Stack. See Section 6.4 for more information about Timer/Counters. 


Writing to this location loads the pre-load register for Timer/Counter 0. 


4.3.18 Timer/Counter 1 - Address 14H 


Reading this location pushes the current contents of Timer/Counter 1 onto the 
Parameter Stack. See Section 6.4 for more information about Timer/Counters. 


Writing to this location loads the pre-load register for Timer/Counter 1. 


4.3.19 Timer/Counter 2 - Address 15H 


Reading this location pushes the current contents of Timer/Counter 2 onto the 
Parameter Stack. See Section 6.4 for more information about Timer/Counters. 


Writing to this location loads the pre-load register for Timer/Counter 2. 
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4.3.20 Address 16H 


Operations using this address depend upon | ietict the RTX 2000, RTX 2010, 
or the RTX 2001A Microcontroller is being used. — 


4.3.20.1 RTX 2000 - MLR 


On the RTX 2000, this address is the Multiplier Low Register, MLR, and / 
is used with the RTX 2000 on-chip hardware multiplier. 


Reading this location pushes the lower 16 bits of the multiplier output 
onto the Parameter Stack. The contents of TOP are pushed into NEXT, 
but NEXT is not pushed onto the stack. 


The MLR Register is a read-only = on the RTX 2000. 


4.3.20.2 RTX 2001A - rx, Scratchpad/Counting Register 


On the RTX 2001A, this address is the RX Register. The RX Register 
is a general purpose Read/Write. scratch pad register. Special 
ee instructions are available to increment or decrement RX in one cycle. 
24 E This allows the RX register to be easily utilized as a 16-bit program — 
— controlled counting register. 


Incrementing the register contents beyond the "all ones" state results in 
a wrap to the “all zeros" state. Decrementing the register below the “all 
zeros" State results in a wrap to the “all' ones" state. 7 


4.3.20.3 RTX 2010 - MLR 


On the RTX 2010, this address is for the Multiplier Low Register, MLR, 
and holds the least significant 16 bits of the 32-bit product generated by 
the on-chip hardware multiplier. This register is also used to hold the 
least significant 16 bits of the MAC Accumulator, and Barrel Shifter. 
See Section 6.3.2 for information about the Multiplier/Accumulator. 


The MLR can be read or written on the RTX 2010. 
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4.3.21 Address 17H 


Operations at this address depend upon whether the RTX 2000 Microcontroller 
or the RTX 2001A Microcontroller is being used. 


| 4.3.21.1  RTX 2000 - mur 


On the RTX 2000, the Multiplier High Register, MHR, is used with the 
on-chip hardware multiplier. 


Reading this location pushes the upper 16 bits of the multiplier output 
onto the Parameter Stack. The contents of TOP are pushed into NEXT, 
but NEXT is not pushed onto the stack. The MHR Register is a read-only 
register on the RTX 2000. 


4.3.1.2 RTX 2001A - RH 
On the RTX 2001A, this address is for the RH Register. This is a 16-bit 


scratchpad register for data storage, which provides faster access than 
access to memory or a location buried in the stack. 


4.3.21.3 RTX 2010 - MarR 


‘On the RTX 2010, the Multiplier High Register, MHR, holds the most 
Significant 16 bits of the 32-bit product generated by the on-chip 
hardware multiplier. If the IBC Register’s ROUND bit is set, this 
register contains the rounded 16-bit output of the multiplier. In the 
Accumulator context, this register holds the middle 16 bits of the MAC, 

or the most significant 16 bits of the Barrel Shifter. See Section 6.3.2 
for information about the Multiplier/Accumulator. 
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5 External Bus Interfaces 


Addresses for access to external memory or ASIC devices are output via either 
the Memory Address Bus (MA19-MA01) or the ASIC Address Bus (GAQ2- 
GAO0). | | 7 a 


External data is transferred by the ASIC Data Bus (GD15-GD00) and the fa) 
Memory Data Bus (MD15-MD00), which are both bidirectional buses. ae 


% 
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5.1 ASIC Bus Interface 


The ASIC Bus services both internal processor core registers and the on-chip 
peripheral registers, and eight external off-chip ASIC Bus locations. 


All ASIC Bus operations require a single cycle to execute and transfer a full 16- 
bit word of data. The external ASIC Bus maps into the last eight locations of the 
32 location ASIC Address Space. The three least significant bits of the address 
are available as the ASIC Address Bus. See Table 5.1 for the address map. 


ee 5.1: ASIC BUS MAP 


| __ ASIC BUS SIGNAL | 


GA02 | GA00 |GA00 


ASIC ADDRESS 
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5.1.1 RTX 2000 and RTX 2001A Extended Cycle Operation 


On the RTX 2000 and RTX 2001A, bus cycle timing can be extended by 
1 PCLK period to allow the use of some slow memory devices without 
requiring the addition of external Wait states. When the CYCEXT bit (IBC 
bit 7) is set equal to 1, extended cycles are used for all User Memory and 
Interrupt Acknowledge cycles. 
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5.1.2 RTX 2010 Extended Cycle Operation 


On the RTX 2010, the user has the option of independently extending bus 

cycle operations by 1 PCLK period for either User Memory Cycles or for 

ie oe ASIC Bus Read operations. This provides the ability to interface to some 

. é peripherals and slow memory devices without using externally generated 
er Wait states. : 


Setting the Cycle Extend bit (CYCEXT), bit 7 of the IBC Register, will 
cause extended cycles to be used for all accesses to User memory. 


Setting the ASIC Read Cycle Extend bit (ARCE), bit 13 of the CR Register, 
will cause extended cycles to be used for all Read accesses on the external 
ASIC Bus. : 


Both the CYCEXT bit and the ARCE bit are cleared on Reset. 
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5.2 Memory I Interface 


The RTX processors directly address 512K woul: of memory, divided into 16 
pages of 32K words each. | 


The memory page currently being addressed is ‘selected by one of three 4-bit 
“address page" registers, depending on what type of memory access is being 
performed. 


The RTX addresses 3 types of memory space, each with an associated address 
page register. These are Code space, Data space, and User space. 


¢ Code Memory Space is accessed by all instruction fetch operations. 
See Section 5.2.1. 


e Data Memory Space refers to all memory locations accessed by 
memory reference instructions. See Section 5.2.2. 


© User Memory Space provides efficient access to a block of 32 words 
_ which may reside anywhere in the processor’s memory space. See 
Section 5.2.3. 


The RTX instruction set includes classes of instructions for referencing each type 
of memory space. With the exception of instruction fetches and streamed MAC 
operations, RTX memory accesses involve the TOP and NEXT registers. 


The TOP register contains the address of the memory location-to be read or 
written. The NEXT Register interfaces to the Memory Data Bus. For memory ~ 
writes, the value contained in NEXT is written to the location addressed by the 
contents of TOP. For memory reads, the contents of the memory location 
addressed by the contents of TOP are loaded into NEXT, then the stack is popped, 
dropping the address and leaving the memory data in TOP. 
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The RTX’s memory reference instructions have various forms which determine 
the net stack effect of the memory read or write. Depending on the instruction 
format, the contents of TOP and NEXT may be overwritten by memory data, 

preserved on the stack, or modified through ALU operations. 


‘The RTX’s 20-bit Memory Address Bus is composed of the 16-bit address from. 
a the TOP register, and 4 bits from the appropriate address page register. 


The Code Page Register is used for all references to Code memory space, and 
the Data Page or Code Page Register for all references to Data Space. The Code 
and Data Page Registers may point to the same memory page, as in a system 
containing all RAM memory, or to different pages, as in a system with mixed 
ROM and RAM. Additionally, the CPR and DPR may point to the same page for 
small RAM/ROM systems. User Space addresses are a special case, and are 
discussed in Section 5.2.3. | 


The page address registers may be read or written by using ASIC Bus access 
instructions (see Chapter 7). The registers may be read at any time to determine 
the current active memory pages. : 
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5.2.1 Code Memory Space 


Code memory space contains machine instructions to be executed by the RTX eee 
processor. re cot 


5.2.1.1 Subroutine Calls and Returns 


RTX subroutine calls take place within the memory page specified by the Code 
Page Register. Any instruction with Bit 15 (the most significant bit) set to 0 will 
cause a subroutine call to the address contained in the lower 15 bits of the 
instruction. The address to be called is calculated by shifting the value contained | 
in the instruction left by one bit and inserting a zero in the least significant bit. 
For example, the machine instruction 3211H (Hex) will cause a subroutine call 
to location 6422H. See Table 5.2. | 


Long Calls may be made to a memory page other than the current Code page by 
first loading the appropriate page number into the Code Page Register, then 
executing the subroutine call. 


Loading a value into the Code Page Register performs two special functions. 
First, the effect of loading the Code Page Register is delayed by one instruction, 
so that the instruction following the load instruction is fetched from the current 
code page. Second, interrupts are disabled for one clock cycle following the load 
instruction. This guarantees that the instruction following the load (typically the 
Call instruction) will be executed without an intervening Interrupt Service 
Routine which might corrupt the contents of the registers. 


Subroutine calls save their 21-bit return address on the Return Stack ( 
element is composed of the Index Register and the Index Page Sa ee 
subsequent calls occur, the storing of subroutine return addresses in IPR ae ; 
causes the previous contents of IPR and I to be pushed into the Return Stack. 

The 21-bit return address is contained in the Index Register and the Index Page 
Register and consists of the following: 
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Index Register (I), at address 01H 


¢ BitO Set to 1 if call results from an interrupt acknowledge, 0. 
| otherwise. As indicated by "i" in Figure 5.2. 


© Bits 1-15 Word address to which to return (bits 1-15 of ee 
Counter). The least significant bit of the return address is — 
implicitly 0 since instructions are always fetched on word 

boundaries. —— 


Index Page Register (IPR) 
° Bits 0-3. Code page to hich to return 


e Bit4 — Valueof DPRSEL bit ne — of Data Memory). As 
indicated by "D" in Figure D2: 


IPR and I 
Registers 
(current 

return address) 


| | Top stack memory 
Program Counter element of Return 
Stack (previous 
return address) 


| | Second stack 
_ Program Counter | memory element 
: | (etc.) 


FIGURE 5.2: RETURN STACK STRUCTURE. 
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Example: Code executing at location 1220H in Code Page 3 calls a 
subroutine located at address 3322H in Code Page 4. See Table 5.2. 


TABLE 5.2: SAMPLE SUBROUTINE CALL AND RETURN 


Actual 
Address Address 
Cycle Code Page (PC) (MA19-MA01) Instruction 
1 3 1220 31220 Set CPR = 4 
2 3 1222 31222 Call location 3322H 
3 4 3322 43322 1st subroutine instruction 
4 4 3324 43324 2nd instruction 
5 4 3326 43326 Return From Subroutine 
6 3 1224 31124 Ist instruction after Call 


5.2.1.2 Branching 


Branching instructions work similarly to subroutine calls. Branches may be © 
performed across page boundaries by first loading the Code Page Register with 
the new page number (the current page plus 1 for forward branches, the current 
page minus one for backward branches). See Chapter 7 for specific details on 


branch instructions. 


If the instruction following a "Load Code Page Register" instruction is not a Call 
or Branch, it will be executed, then the next instruction will be fetched from the 
memory page specified by the new contents of the Code Page Register. 
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Data Memory refers to memory locations waceused by the RTX’s "Data Memory | 
Access" class of instructions. These would typically be RAM locations used for 
variables and data storage. 


5. 2.2.1 Memory Page Selection 


The memory page referenced by data memory instructions may be selected by 
either the Data Page Register or the Code Page Register. The DPRSEL bit (Data. 
Page Register Select bit, IBC bit 5) determines which register will be used. 


When DPRSEL = 0, all 1 main memory accesses will occur in the memory page 

addressed by the Code Page Register. This is the default mode. In this mode, 
code and data memory spaces to reside in the same memory page, and would 
typically be used in systems with 64K bytes or less of memory. In such a 
system, the Memory Address Bus bits generated by the page select logic os 7 
MA19-MA16) would not be required. 


When DPRSEL = 1, all main memory accesses will occur in the memory page | 
addressed by the Data Page Register. The Data Page Register may point to the 
Same page as the Code Page Register, or to. a Separate page. 


The state of the DPRSEL bit may be controlled through three methods. First, 
DPRSEL is directly readable and can be set as a bit in the Interrupt Base/Control - 
register. Second, it may be set or reset in one clock cycle by special forms of 
the Register read/write instructions pertaining to the Data Page Register (see 
"Predefined ASIC Bus Instructions" in Chapter 7). Third, DPRSEL is saved as — 
bit 4 of the Index Page Register during subroutine calls. The value in IPR may _ 
be modified by a subroutine; the new value will be written into the DPRSEL bit — 
and take effect as soon as a Subroutine Return instruction is executed. 
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5.2.2.2 -Memory Access Mode Selection 


To support the use of shared memory interfaces with other processors, the RTX 
can be configured to access Data Memory in either of two modes which 
determine whether the byte order in memory will be High-Low (Mode 0) which 
is the default mode, or Low-High (Mode 1). Bit 2 of the Contiguranon Register 
| is used to select the Data Memory Access Mode. 


The default, Mode 0, is selected: when CR bit 2 = 0. This means that the most 
Significant byte of data in the processor register (NEXT) will be read from or 
written to the even byte address in memory, and the least significant byte of data 
in NEXT will be read from or written to the odd byte address in memory. 


Mode 1 is selected when CR bit 2 = 1. This means that the most significant 
byte of data in NEXT will be read from or written to the odd byte address in 
memory, and the least significant byte of data in NEXT will be read from or 
written to the even byte address in memory. See Figure 5.3. 


MODE 0 - MODE 1 


: . —, Data Byte 
MS BYTE LS BYTE MS BYTE LS BYTE Order 
: —— . in NEXT a bs 


: Data Byte 
MS BYTE LS BYTE | LS.BYTE BYTE MS | MS BYTE Order in 
| . et in 


| Even Address | Odd Address | - aE Even Address | Odd Address 
| _ FIGURE 5.3: MEMORY ACCESS MODES 


In addition to allowing selection of byte order, the RTX allows the user to 
choose between TaCCeSSInE Data Memory } in either 16-bit words or 8-bit bytes. 
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5.2.2.3 Memory Access Examples 


Byte reads from locations 1000H and 1001H will both read a byte from word 
address 1000H. CR bit 2 and bit 0 of TOP determine which byte of the memory 
location will be accessed. For 8-bit writes, only bits 0-7 of NEXT are transferred 
to memory. For 8-bit reads, data from memory is transferred into bits 0-7 of 
NEXT; bits 8-15 of NEXT are set to 0. 


Example: Reading and Writing a 16-bit value (1234H) to memory location 
1000H (all values are in hexadecimal) yields the results shown in Figure 5.6 at 
the end of the first cycle. 


Memory Read — 


CR bit 2=0 _ CR bit 2 = 1 
Memory 1000 1001 1000 1001 


TOP 
NEXT - 


Memory 1000 1001 | 1000 1001 


| 12 |. 34 14: | 34 | 12 | 
a Ces —— se 


FIGURE 5.6: 16-BIT READ/WRITE TO EVEN MEMORY ADDRESS 
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The least significant bit of the memory address contained in TOP may also be fee 


used to control the Byte-swapping feature. If the LSB of TOP is 1 when 
performing a 16-bit memory access, then an odd byte address is being accessed. 
This means that the same word address will be read or written, but the bytes of 
the word read or written to memory will be swapped. 


Accessmy a word with the LSB of the address set to 1 effectively inverts the Byte 
Order bit. 


Example: Reading and Writing a 16-bit value (1234H) to memory location 
1001H (all values are in hexadecimal) yields the results shown in Figure 5.7 at 
the end of the first cycle. 


Memory Read 


CR bit 2 = 0 CR bit 2 = 1 
Memory 1000 1001 1000 1001 


RJ 


NEXT 3412 


| nares 


Memory 1000 1001 1000 1001. 


[34] 12 | [12 | 3s | 
| ees Cees | a: ene | 


FIGURE 5.7: 16-BIT READ/WRITE TO ODD MEMORY ADDRESS 
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The Byte Order bit also affects 8-bit memory accesses. If the Byte Order bit is 
set to 1, the LSB of the address contained in TOP is inverted before pertorming 
the memory read or write. Following are two examples. 


Soh is | | 
=<=8 4 . Example: Reading and Writing an 8-bit valge to memory location 1000H yields 


OE the the results shown in — 5.8 at the end of the first cycle: 


| Memory Read : 
CR bit 2 = 0 CR bit 2=1 
Memory 1000 1001 , 1000 1001 


NEXT 0012 


| Se eee | 


Memory Write 


CR bit 2=0 CR bit 2 = 


= not changed 
1000 1001 


FIGURE 5.8: 8-BIT TO EVEN MEMORY ADDRESS 
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Example: Reading and Writing an 8-bit value to memory location 1001H yields 
the results shown in Figure 5.9 at the end of the first cycle. 
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Memory Read 


CR bit 2 = 0 CR bit 2 = 1 
Memory 1000 1001 1000 1001 


re 0 
NEXT 0034 


Ceca 


Memory Write 


CR bit 2 =0 CR bit 2 = 


EF 


1000 1001 1000 1001 


FIGURE 5.9: 8-BIT READ/WRITE TO ODD MEMORY ADDRESS 
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5.2.3 User Memory Space 


User Memory space is a block of 32 words which the RTX processor can access 
without having to first calculate an address and load it into TOP. The logical 
address to be referenced within the 32-word block is embedded in the machine 
instruction which accesses the memory location. 


User Memory space would typically be ‘iked to hold data which must be accessed | 
frequently, such as system parameters or context save areas in a multi-tasking 
system. See Chapter 7 for descriptions of the User Memory Reference 
Instructions. 


The physical address to be accessed when addressing User Memory space is . 
_ derived from several components, shown in Figure 5.10. 


The User Page Register (UPR bits 03-00) points to then memory page to be used 
for User Memory Access. The User Base Address Register (UBR bits 15-06) 
contains the offset. for the particular 32-word block to be accessed by User 
Memory Instructions. The exact word in the 32-word User block to be accessed 
is specified by the address contained in the lower > bits of the User Access 
Instruction. 7 7 5 


As indicated in Figure 5. 10, bits 05-01 of t the UBR Register are logically OR’ed 
with the 5 address bits embedded i in the User Access Instruction (IR bits 04-00), 
and the results yield the next five memory address bits (MA05-MA01). Because 
of the logical OR which takes place, only the ten most significant bits of UBR — 
should be used to specify the User Base Address, since setting the lower bits will 
have the effect of reducing the user block size by duplicating addresses. 


Finally, since User accesses are always on: word boundaries, bit 0 of the UBR 
should always be zero. 


Table 5.3 provides some samples of addresses, as. determined by the contents of 
UPR, UBR, and IR. 
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USER PAGE 
REGISTER [1514131211109 18|7/61514/3;2/ 110) 
ATS 


RESERVED* 


REGISTER 


aS 
NOT 


BEN ha a 
THIS ADDRESS 


"instruction (iatsiaitid 8 6]7 6's 4]3 "27170 
REGISTER Aca | RReRENNED: 


FIGURE 5.10: USER MEMORY ADDRESS COMPONENTS 
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TABLE 5.3: USER MEMORY ADDRESS EXAMPLES 


UPR (4 bits) UBR (10 bits) Address field (5 bits) Actual Address (20 bits) 


from IR 
OH 1240H | 3H (01246H 
2H 3140H OFH | | 23 15EH 
2H 33101 1FH | 2333EH 


Note that in the third example some jocations within the 32-word block will not 
be accessible because bit 4 in the User Base Register is set to 1 and will cause 
the corresponding bit of the address to always be set due to the OR operation. 


By adjusting the contents of the User Page Register and User Base Register, an 
application may have any number of 32-word User spaces (up to 1 megabyte). 


The byte-swapping operations described for the Data Memory accesses do not 
affect User memory accesses. 
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6 On-Chip Peripherals 


The RTX 2000 Series microcontrollers contain hardware to support many of the | 
functions typically needed in real-time control systems. These include two Stack 
Controllers, an Interrupt Controller, and three 16-bit Counter/Timers. 


In addition, the RTX 2000 offers an on-chip 16-by-16 Hardware 
Multiplier, while the RTX 2010 offers an on-chip Multiplier/ 
Accumulator, Leading Zero. Detector, 32-bit Barrel Shifter, 
hardware floating point support, and multi-tasking stack support. 


All on-chip peripheral devices are accessible through the ASIC Bus by the use 
of ASIC Bus Read and Write instructions. The contents of the TOP register may 
be written to the devices, and the outputs of the devices may be read through the 
ALU into the TOP register. 


This section contains the information necessary for programming the On-Chip 
Peripheral devices. Refer to Chapter 4 for more information about the ASIC Bus 
addresses for each device. 
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6.1 Stack Controllers 


Each RTX Microcontroller contains two identical stack controller circuits, one 
for the Parameter Stack, and one for the Return Stack. The RTX Stack 
Controllers utilize stack pointers and stack limits for control of stack operations. 
Specific details of how the stack controllers work are determined by the type of 
processor being used. ‘eS | 


On the RTX 2000, operation of the Programmable Stack Controllers 
depends on the contents of two registers, the Stack Pointer Register 
(SPR), and the Stack Limit Register (SLR). 


On the RTX 2001A, operation of the Programmable Stack 
Controllers depends on the contents of three registers. These 
registers are the Stack Pointer Register (SPR), the Stack Overflow 
Limit Register (SVR), and the Stack Underflow Limit Register 
(SUR). 


On the RTX 2010, operation of the Programmable Stack Controllers 
depends on the contents of three registers. These registers are the 
Stack Pointer Register (SPR), the Stack Overflow Limit Register 
(SVR), and the Stack Underflow Limit Register (SUR). To use these - 
registers to perform Multitasking operations, see Section 6.1.3.2. 
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6.1.1 Stack Pointer Operation 


The Stack Pointers for both stacks are combined into one 16-bit register for fe 
access through the ASIC Bus. This register may be used to read and write both fa 
stack pointers in parallel. The stack pointers are used to determine the "top" Fe 2m. 
location in stack memory for each stack. , oes 
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6.1.1.1 Stack Pointers For the RTX 2000 


On the RTX 2000, the value for each stack pointer is initialized to 
a value of 0 at reset, and can range from 0 to 255. Each stack 
pointer indicates the position of the "top" item in stack memory, 
which contains the data that was most recently pushed into the 
stack. See Figure 6.1.. ee 


es Ss?) a a 
151419121910 9/8 |7;6;5) 4,3. 2; 1,0) 


151413121110 9,8; 7,6, 5)4,3)2) 1,0 


satiate estennettaranteteattebae ataueen netomat eee 


FIGURE 6.1: RTX 2000 STACK CONTROL 
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The Stack Pointer Weicad is at ASIC address 09H, and may be 
used to read and write both stack pointers. Bits 0-7 contain the 
stack pointer value for the Parameter Stack, while bits 8-15 contain 


2 the pointer value for the Return Stack (see Figure 6.2). 


151413121410 9 | I CIIICIETET 
Sesser ese a ctmmmmmcmmsmimsensredl raters at fend ears 


PSP, Parameter 
Stack Pointer 


>» RSP, Retum 
Stack Pointer 


FIGURE 6.2: RTX : 2000 STACK . REGISTER 


During a stack push operation, the SPR 1S incremented by 1 before 
the new item is pushed onto the stack. (i.e., when the operation 
begins, the register contains the address of the next stack location 
to be written for each stack). The Stack Pointer may be set to a 
new value by writing to SPR; the value written to the register should 
be one less than the address of the first location to be written. 


During a stack Read operation, the pointer indicates the next item 
which can be popped from the stack memory. After that item has 
been popped, the stack pointer is decremented by 1. Since reading 
the stack pointer pushes a value onto the Parameter Stack, the value 
read will be 2 more than the number of items on the Parameter 
Stack prior to reading the Stack Pointer Register. 


Stack Underflow on the RTX 2000 - The SPR monitors the total 

number of items on the stacks, and will generate a "stack . 
underflow" interrupt request if moré items are popped from the 

stack than were pushed onto it. The underflow signals are fed to 

the Interrupt Controller (see Section 6.2) and may be masked 

through the Interrupt Mask Register (IMR). 
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6.1.1.2 RTX 2001A and RTX 2010 Stack Pointers 


On the RTX 2001A and RTX 2010, the value for each stack pointer _ 
is initialized to a value of 0 at reset. On the RTX 2001A the stack 
pointer values can range from 0 to 63; on the RTX 2010 they can 
range from 0 to 256. 
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Each stack pointer indicates the position of the "top" item in stack 
memory, which contains the next stack element to be accessed in a 
stack write operation. After a stack write ("push") operation, the 
stack pointer is incremented. | | 


In a stack read operation, the stack memory location with an address 


one less than the pointer location will be accessed. After a stack 
~ read ("pop") the pointer is decremented. See Figure 6.3. 


STACK MEMORY 8 


‘FIGURE 6.3: RTX 2001A/2010 STACK CONTROL 
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- Reserved * 


RSP, Return Stack 
Pointer 


Reserved * 


Pointer 
FIGURE 6.4: RTX 2001A STACK POINTER REGISTER 


On the RTX 2010, bits 0-7 contain the stack pointer value for the 
Parameter Stack, while bits 8-15 contain the pointer value for the 
Return Stack. See Figure 6.4. 


15 4 13 12|1110)0 18/76 )514/3)2) 10) 
Se 


——— PSP, Parameter Stack 
. Pointer 


RSP, Return Stack 
Pointer 


On the RTX 2001A, bits 0-5 contain the stack pointer value for the 
Parameter Stack, while bits 8-13 contain the pointer value for the 
Return Stack. See Figure 6.4. 

| , - PSP, Parameter Stack 


FIGURE 6.5: RTX 2010 STACK POINTER REGISTER 
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6.1.2 Stack Limit Operation 


Stack limits are used to prevent data stored in the stack from being overwritten. 
Since the stacks wrap around, existing data on the stack will be overwritten by 
the new data when an overflow occurs. Underflows occur when an attempt is 
made to pop data off an empty stack, causing invalid data to be read from the 
stack. Since the processor can take up to four clock cycles to respond to an 
interrupt, the values set into the stack limit registers should include a safety 
margin which allows valid stack operation until the processor executes the 
interrupt service routine. 


On the RTX 2000, RTX 2010, and RTX 2001A, a buffer zone may 
be set up so that stack error interrupts are generated prior to an 
actual overflow. In addition, the RTX 2001A and RTX 2010 
Underflow Limit Registers provide the capability to define an 
underflow buffer. 


The RTX 2000 Family processors utilize ASIC Address OBH for the 
16-bit, write-only register which contains the maximum stack size 
limits for the Parameter and the Return Stacks. On the RTX 2000, 
this register is called the Stack Limit Register, (SLR). On the RTX 
2001A and RTX 2010, it is called the Stack Overflow Limit 
Register, (SVR). 7 


6.1.2.1 Stack Limits For the RTX 2000 


On the RTX 2000, the maximum limit for the Parameter Stack is in 
bits 0-7 of the Stack Limit Register; bits 8-15 contain the maximum 
limit for the Return Stack (see Figure 6.6). These limit values 
determine the number of items which may be pushed onto each 
stack before the Interrupt Controller will generate a "Stack 
Overflow" interrupt signal. The Limit Register for both stacks must 
be initialized on powerup or reset, if stack error interrupts are to be 
used. 
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FIGURE 6.6: RTX 2000 STACK LIMIT 


6.1.2.2 Stack Limits For the RTX 2001A 


The RTX 2001A and RTX 2010 Microcontrollers utilize two 
registers to provide stack limit control. They are the Stack 
Overflow Limit Register, SVR at ASIC address OBH, and the Stack 
Underflow Limit Register, SUR at ASIC address 0AH; SVR is write- 
only register. 


Overflow limits: The overflow limit is the number of items which 
may be pushed onto the stack before an interrupt will be detected. 
Bits 0-5 of the Stack Overflow Limit Register contain the maximum 
limit for the Parameter Stack, and bits 8-13 contain the maximum 
limit for the Return Stack (see Figure 6.7). 


15 14 13,12]11 00,0817 ,6,5,4]3,2,1,0 
LY TY 
PVL: Parameter 
Stack Overflow Limk 


Reserved, should be set 
=Q@ during Write operations 


RVL: Return Stack 
Overflow . Limit 


Reserved, should be set 
= @ during Write operations 


FIGURE 6.7: RTX 2001A STACK OVERFLOW 
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Underflow limits: Bits 3-7 of the Stack Underflow Limit Register 
contain the underflow limit for the Parameter Stack, and bits 11-15 
contain the underflow limit for the Return Stack. See Figure 6.8. 


SUR 


1514) 13;12] 11/10; 98 | 7,65) 4/32) 110) 

~~ © ’ -PSF: Parameter Stack 
Start Flag 

Parameter Substack bi: 


= 0: two 32 word stacks 
= 1: one 64 word stack 


Reserved * 


PSU: Parameter 
Stack Underfiow Limt 
O- 31 words from 
bottom of stack 


RSF: Return Stack 
Start Flag 


Return Substacks bi: 
= 0: two 32 word stacks 
= 1: one 64 word stack 


Reserved * 


- RSU: Return Stack 
Underfiow Lima 
O- 31 words from 
bottom of stack 


FIGURE 6.8: RTX 2001A STACK UNDERFLOW LIMITS 


6.1.2.3 Stack Limits For the RTX 2010 


The RTX 2010 Microcontroller utilizes two registers to provide 


Stack limit control. They are the Stack Overflow Limit Register, 
SVR at ASIC address OBH, and the Stack Underflow Limit 
Register, SUR at ASIC address OAH. 


Overflow limits: The overflow limit is the number of items which 
may be pushed onto the stack before an interrupt will be detected. 
Bits 0-7 of the Stack Overflow Limit Register contain the maximum 
limit for the Parameter Stack, and bits 8-15 contain the maximum 
limit for the Return Stack (see Figure 6.9). 
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PVL: Parameter 
Stack Overflow Limit. 


Number of words from 
top of current substack 
RVL: Return Stack 
Overflow Limit. 
Number of words from 
top of current substack 


“FIGURE 6.9: RTX 2010 STACK OVERFLOW LIMITS 


Underflow limits: Bits 3-7 of the Stack Underflow Limit Register 
contain the underflow limit for the Parameter Stack, PSU, and bits 
11-15 contain the underflow limit for the Return Stack, RSU. See 
Figure 6.10. In addition, this register is utilized to define the use 
of substacks for both stacks (See Section 6.1.3). All Stack 
Underflow Limit Register values must be accessed together. 


SUR 


1514/19)12| 110) 9 8/716) 4|3,2,1,0 


ee, a e/ 
PSF: Parameter Stack 
Start Flag 


Parameter Substack bits: 
00: eight 32 word stacks 
= O01: four 64 word stacks 
= 10: two 128 word stacks 
= 11: one 256 word stack 


PSU: Parameter 
Stack Underfiow Limit 
O- 31 words from 


bottom of substack 


RSF: Return Stack 
Start Flag 


Return Substack bits: 
00: eight 32 word stacks 
01: four 64 word stacks 
10: two 128 word stacks 
11: one 256 word stack 


- RSU: Return Stack 
Underfiow Limit 
O- 31 words from 
bottom of substack 


FIGURE 6.10: RTX 2010 STACK UNDERFLOW LIMITS 
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6.1.3 | Configuration Of Substacks 


The enhanced Stack Controller logic on the RTX 2001A and RTX 
2010 allows the stack related registers to d be used for configuring 
substacks. 


6.1.3.1 Substack Configuration On The RTX 2001A 


Each 64-word stack may be subdivided into two substacks under 
hardware control for simplified management of multiple tasks. Each 
substack is 32 words deep. Stack size is selected by writing to bit 
1 of the Stack Underflow Limit Register for the Parameter Stack, 
and bit 9 for the Return Stack. See Figure 6.7. | 


Substacks are implemented by making bits 5 or 13 of the Stack 
Pointer Register control bits (i.e. they are not incremented when the 
stack size is 32 words). Using this, a particular substack is selected 
by writing a value which contains both the stack pointer value and 
the substack number to the Stack Pointer Register. 


Each stack has a Stack Start Flag which may be used for virtual 
stacks. This is bit 0 of the SUR for the Parameter Stack, and bit 8 
of the SUR for the Return Stack. If the Stack Start Flag is one, the 
stack starts at the bottom of the stack or substack (location 0). If 
the Stack Start Flag is 0, the substack starts in the middle of the 
stack. In a stack 64 elements deep, this is location 32; In a stack 32 
elements deep, this is location 16. An exception to this occurs if 
the overflow limit in the Stack Overflow Limit Register is set for a 
location below the middle of the stack. In this case, the stacks 
always start at the bottom locations. 
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Manipulating the Stack Start Flag provides a mechanism for creating 

en eee a virtual stack in memory which is maintained by interrupt driven 
eo g ~ handlers. 
Possible applications for substacks include use as a recirculating 
buffer (to allow quick access for a series of repeated values such as 
coefficients for polynomial evaluation or a digital filter), or to log 
a continuous stream of data until a triggering event (for analysis of 
data before and after the trigger. without having to store all of the 
incoming data), as in the case of a digital oscilloscope or logic 
analyzer. 


See Table 6.1 for control bit settings for possible stack/substack 
configurations. In Table 6.1, note the following: 


ce SPR is the Stack Pointer Register; SVR is the Stack Overflow 
Limit Register; SUR is the Stack Underflow Limit Register. 

2. PO through P15 are the SPR bits; VO through V15 are the SVR 
bits: UO through U15 are the SUR bits. 

3. The Overflow Limit is the stack memory address at which an 
overflow condition will occur during a stack write operation. 

4. The Underflow limit is the stack memory address below which an 

| underflow condition will occur during a stack read operation. 

=e The Fatal limit is the stack memory address at which a fatal error 
condition will occur during a stack read or write operation. 

6. Stack error conditions remain in effect until a new value is 
written to the SPR. : 

7. Stacks and substacks are circular. After writing to the highest 
location in the stack, the next location to be written to will be 
the lowest location; after reading the lowest location, the 


highest location will be read next. 
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6.1.3.2 | Substack Configuration On The RTX 2010 


Each 256-word stack may be subdivided into up to eight 32-word 
substacks, four 64-word substacks, or two 128-word substacks. 
This is accomplished under hardware control for simplified 
management of multiple tasks. Stack size is selected by writing to 
bits 1 and 2 of the SUR for the Parameter Stack, and bits 9 and 10 
for the Return Stack. | 
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Substacks are implemented by making bits 5-7 of the SPR (for the 
Parameter Stack) and bits 13-15 of the SPR. (for the Return Stack) 
control bits. For example, if there are eight 32-word substacks 
implemented in the Parameter Stack, bits 5-7 of the SPR are not 
incremented, but instead are used as an offset pointer into the 
Parameter Stack to indicate the beginning point (i.e. substack 
number) of each 32-word substack implemented. Because of this, 
a particular substack is selected by writing a value which contains 
both the stack pointer value and the substack number to the SPR. 


Each stack has a Stack Start Flag which may be used for virtual 
stacks. This is bit 0 of the SUR for the Parameter Stack, and bit 8 
of the SUR for the Return Stack. 


If the Stack Start Flag is one, the stack starts at the bottom of the 
stack or substack (location 0). If the Stack Start Flag is 0, the 
substack starts in the middle of the stack. In a stack 256 elements 
deep, this 1s location 128; In a stack 128 elements deep, this is 
location 64; In a stack 64 elements deep, this is location 32; In a 
stack 32 elements deep, this is location 16. | 


An exception to this occurs if the overflow limit in the Stack 
Overflow Limit Register is set for a location below the middle of 
the stack. In this case, the stacks always start at the bottom 
locations. See Tables 6.2 and 6.3 for the possible stack 
configurations. Manipulating the Stack Start Flag provides a 
mechanism for creating a virtual stack in memory which is 
maintained by interrupt driven handlers. 
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Possible applications for substacks include use as a recirculating | 
buffer (to allow quick access for a series of repeated values such as 
coefficients for polynomial evaluation or a digital filter), or to log 
a continuous stream of data until a triggering event (for analysis of . 
data before and after the trigger without having to store all of the . 
incoming data). The latter application could be used in a digital — Cee 
oscilloscope or logic analyzer. . 
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6.1.4 ‘Stack Error Conditions 


Stack errors which may occur on the RTX 2000, RTX 2001A, and RTX 2010 
Microcontrollers are overflow and underflow. 


s ee | . | 
sD An overflow occurs when an attempt is made to push data onto a full stack. 


Boers 


ee “=, Since the stacks wrap around, the result is that existing data on the stack will be 
overwritten by the new data when an overflow occurs. 


ood 


An underflow occurs when an attempt is made to pop data off an empty stack, 
causing invalid data to be read from the stack. 


A buffer zone may be set up using the stack limits to cause a stack error interrupt 
to be generated prior to an actual overflow or underflow occurs. 


6.1.4.1 RTX 2001A and RTX 2010 Fatal Stack Errors 


In addition to the overflow and underflow stack errors, the RTX 
2010 and RTX 2001A provide a fatal error flag. 


A Fatal Stack Error occurs when an attempt is made to push data 
onto or to pop data off of the highest location of the substack. It 
does not generate an interrupt (since the normal stack limits can be 
used to generate the interrupt). The fatal errors for the stacks are 
logically OR’ed together to produce bit 0 of the Interrupt Base 
Control Register, and they are cleared whenever SPR is written to. 
The implication of a fatal error is that data on the stack may have 
been corrupted or that invalid date may have been read from the 
Stack. 
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6.2 Interrupt Controller 


The RTX 2000 Series Interrupt Controller prioritizes 13 interrupt requests, masks 
undesired interrupts, signals the processor core when a valid interrupt has 
occurred, and provides a vector to an interrupt handler to service the interrupt. 


Inputs to the Interrupt Controller come from both internal and external sources. 
Internal sources are the stack overflow and underflow signals, the three 
counter/timers, and the Software Interrupt signal. External sources are the Non- 
Maskable Interrupt'(NMI) input, and the External Interrupt pins EI1-EI5. EI 
_ pins 3, 4, and 5 may be shared with the Counter/Timers for external event 
counting. See Section 6.4 for details. 


Except for NMI, the interrupt inputs may be individually enabled or disabled 
through the Interrupt Mask Register (IMR) at ASIC address 08H. Each bit of the 
IMR corresponds to one interrupt level; Table 6.2 shows the bit associated with 
each level. Setting a bit to 1 disables the corresponding level. Note that the 
Interrupt Disable bit in the Configuration Register must be 0 for any maskable 
interrupts to be recognized by the core.. The NMI input may not be disabled 
through the IMR. 


When the RTX receives an interrupt request, it saves the current contents of the 
Program Counter and Code Page registers in the IPR and I registers, which form 
the logical top element of the Return Stack, then initiates an Interrupt 
Acknowledge (INTA) cycle. During the INTA cycle, the Interrupt Controller 
generates a vector to the appropriate interrupt service routine. The RTX sets the 
Code Page Register to 0, then reads the vector from the Interrupt Controller to 
determine the address of the first instruction to execute for the interrupt service 
routine. 7 


The vector provided by the Interrupt Controller consists of three parts: 


Bits 10-15 Come from bits 10-15 of the Interrupt 
Base/Control Register (IBC). 


© Bits 5-9 Come from the interrupt vector and depend on 
the interrupt level; see Table 6.2. | 


e Bits 0-4 _ Are always 0. 
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The interrupt vector points into a 544 byte table located in Code Page 0 of the 
RTX memory. Each interrupt service routine is allocated 32 bytes in this table. 
If the service routine will not fit in 32 bytes, it may. make calls to any address 
in the RTX’s memory space. The interrupt service routine must include a 
Return-From-Subroutine instruction. 


The interrupt service Table must be located on a 1024-byte address boundary; 
that is, address bits 0-9 must be 0. The IBC register should be initialized with 
_ the upper 6 bits of the address.of the table. For example, if the table is located 
at location 1000H, IBC bits 15-10 should be set to 00010 binary. Table 6.2 
shows the interrupt service routine address associated with each interrupt level. 


The Interrupt Controller samples the interrupt request inputs during each 
instruction at the rising edge of PCLK (except when executing in streamed 
mode). If one or more inputs are active, the Interrupt Controller generates the 
vector corresponding to the input with the highest priority, and signals the core 
processor that an interrupt request is present. The core then initiates an INTA 
cycle. For the timer interrupts, which are edge triggered interrupts, the INTA 
cycle from the processor clears the highest priority timer interrupt and allows the 
Interrupt Controller to process lower priority interrupts. 


The Interrupt Vector Register, IVR, which is a read-only register at ASIC address 
OBH, contains the current vector being generated by the Interrupt Controller. If 
no interrupt request is present, bits 5-9 of the register will contain 10000 binary. 


The IVR vector may be polled for interrupt request information. Note that a 
particular request level must be unmasked in order for the interrupt controller to 
generate a vector for it. 


Conditions may occur in which an interrupt request goes active and then inactive 
prior to the INTA cycle. An example would be a stack operation that overflows 
the stack and a subsequent stack operation that corrects the condition. If the 
interrupt is active long enough, an INTA cycle will be initiated. This results in 
the generation of a "No Interrupt" vector as a valid address and program | 
execution will transfer to the location indicated. Programmers should install a 
service routine for "No Interrupt" to account for this situation. 
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Example: 
The interrupt vector table is located at 2000H. The IBC register should be loaded 


with the binary value 00100xxxxxxxxxx where x0xxx000Kx%x depends on system 
configuration. The Interrupt Controller would generate the following vectors: 


No interrupt - ° 2200H 
NMI -- - 21E0H 
Ell pin +  . 21C0OH 
Timer 0 | -  2100H 
SWI oe 2040H_ 
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TABLE 6.2: ee 


Non- AMéekable vvvv vv01 1110 0000 
ao NMI 

VVVV wl 1100 ‘0000 
paraneter a vvvv vv01 1010 0000 
under f | ow 
Return Stack VVVV wo' 1000 0000 
under flow 
Parameter Stack vvvv vv01 0110 0000 
overflow 


Return Stack revel vvvv vv01 0100 0000 
over f low 


w01 0010 0000 | 


vv01 0000 0000 


ili] 


vv00 1110 0000 
vv00 1100 0000 


ae 


“wv00 1010 0000 


vvvv vv00 1000 0000 


vvvv vv00 0110 0000. 


_ vvvv vv00 0100 0000 
enerr ype 


ww wv10 0000 0000 


Where: 
vvvvvv = bits 10-15 from IBC register 
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6.2.1. Interrupt Acknowledgement 


If interrupts are enabled when the processor receives the Interrupt Controller’s 
signal, it enters an Interrupt Acknowledge (INTA) cycle. During this cycle, the 
processor saves the current execution address on the Return Stack, disables faa 
interrupts as described in Section 3.5.2, then reads a vector from the Interrupt. b= 3) 
Controller which points to the address of a service routine to handle the Bee 
particular interrupt. Section 6.2 describes the Interrupt Controller interface in 


more detail. : 


The INTA cycle sets the least significant bit of the return address saved on the 
Return Stack to a 1, to indicate that the subroutine (Interrupt Service Routine) 
was called as a result of an interrupt. 


When the service routine executes a Return-From-Subroutine instruction to 
resume execution from the point where the processor was interrupted, the set 
LSB of the Return Stack causes interrupts to be enabled automatically. The 
Interrupt Service Routine can also re-enable interrupts, but is then subject to 
being interrupted by another interrupt. 
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6.2.2 Disabling Interrupts 


The processor can enable or disable all maskable interrupts at any time by 
controlling the state of the Set Interrupt Disable bit in the Configuration Register 
(CR bit 4). Setting this bit to 1 disables interrupts; this is the state of the bit 
when the RTX is reset. The processor will not recognize interrupts until the bit 
is reset to 0 by writing to the CR register. 


The CR register contains two bit positions associated with the Interrupt Disable 
bit. | 


The Set Interrupt Disable (SID) bit is a write-only bit which is used to set or 
reset the bit under program control; this bit will always read as 0 no matter what | 
the bit is set to. This provides a convenient mechanism for quickly enabling 
interrupts, whereby the CR register is read onto the Parameter Stack (reading the 
SID bit as a 0), then immediately rewritten, effectively clearing the SID bit to 0 
(enabled). This process requires only two clock cycles, eliminating the extra 
time it would take to read the register, mask the bit, then rewrite the register. 


The Interrupt Disable Status bit (CR bit 14) is a ron) bit which contains the 
true state of the Interrupt Disable bit. 


CR bit 15 indicates the status of the core interrupt request input from the 
interrupt controller. This bit may be polled to determine interrupt status when 
core interrupts are disabled. 
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6.2.3 Software Interrupt 


The RTX has a single level Software Interrupt capability. A special form of one 
of the RTX I/O write instructions sets a flip-flop attached to one input of the 


Interrupt Controller. If the interrupt level associated with the Software Interrupt fe 


is unmasked (See Section 6.2), this input causes the Interrupt Controller to 
generate a vector pointing to the service routine corresponding to the Software 
Interrupt. 


A separate I/O instruction clears the Software Interrupt Request flip-flop. The 
service routine for the Software Interrupt must execute this instruction before re- 
enabling interrupts or returning to normal program execution to prevent another 
SWI cycle from being executed. This interrupt request input is level-sensitive 
~ and will continue to generate interrupts until the flip-flop is reset. See 
"Predefined ASIC Instructions" in Chapter 7 for the machine instructions which 

set and clear the flip-flop. 


Due to internal delays in generating the interrupt request, and the fact that the 
Software Interrupt is assigned to the lowest priority level, the interrupt will not 
be serviced for two instructions following execution of the Software Interrupt 
instruction. This means that the instructions immediately following the Software 
Interrupt should not assume that the interrupt has been serviced. 


Inserting two nops between the Software Interrupt insieuction and the instruction 
which follows it will guarantee that the software interrupt will be serviced before 
the following instruction is executed. 
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On-Chip Hardware Math Support 


- For math intensive applications, the RTX 2000 Microcontroller is 
provided with a 16-bit on-chip hardware multiplier. 


The RTX 2010 is provided with a 16-bit on-chip hardware 
Multiplier/Accumulator, _32-bit Barrel Shifter, Leading Zero 
Detector,.and hardware Floating Point support. 


The RTX 2001A does not have these features. 


6.3.1 RTX 2000 Multiplier Operation 


The hardware multiplier on the RTX 2000 multiplies two 16-bit 
numbers, yielding a 32-bit product, in one clock cycle. The 
multiplier can treat the input operands as either signed (two’s 
complement) or unsigned integers, and can optionally round the 
result to 16 bits. 


The multiplier’s input operands come from the TOP and NEXT 
registers. The multiplication function is activated by a special form 
of the ASIC Bus write instructions to the Multiplier High (MHR) or 
Multiplier Low Register (MLR) address. 


The form of the instruction used determines whether the operands 
will be treated as signed or unsigned values. See Section 7.7.1 for 
the exact instruction coding. Note that the multiply instructions do 
not pop the Parameter Stack; the contents of TOP and NEXT remain 
intact. 


The product is stored in the Multiplier High and Multiplier Low 
Registers. The Multiplier High Register contains the upper 16 bits 


vi the product, while the Multiplier Low Register contains the lower 
16 bits. 


The registers may be read in either order, and there is no 

requirement that both registers be read. Reading either register 

moves its value into the TOP register, and pushes the original value 
_In TOP into NEXT. The original value of NEXT is lost; it is not 
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pushed onto the Parameter Stack. This permits overwriting the 
original operands left in TOP and NEXT, which were not popped me 
the multiply operation. See Figure 6.8. 


If 32-bit precision is not required, the multiplier output may be 
rounded to 16 bits. This is accomplished by setting the ROUND bit 
in the Interrupt Base/Control Register to 1. The ROUND bit 
functions independently of signed or unsigned mode. 


If the ROUND bit is set to one, all operations that use the multiplier 
automatically round the lower 16 bits of the result into the upper 16 
bits. The rounding is achieved by adding 8000H to the least 
significant 16 bits (during the same cycle as the multiply). Thus, 
if the ROUND ove is set, after a ana the result will be as 
follows: 


e If the most significant bit of the MLR is set (=1), the MHR is 
incremented and the MSB of MLR will be 0. 


© If the most significant bit of the MLR is not set (=0), the MLR iS 
left unchanged, and the MSB of the MLR will be 1. 


The multiply instructions disable interrupts during the multiplication 

cycle, and for the next two clock cycles. Reading either result 
register also disables interrupts during the read, and for the next 
clock cycle. This allows a multiplication operation to be performed, 
and both the upper and lower registers to be read sequentially, with 
no danger of an interrupt service routine corrupting the contents of 
the registers between reads. a 
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~ FIGURE 6.8: RTX 2000 MULTIPLIER OPERATION 


Example 1: A typical multiplication sequence without rounding 


1. Set ROUND bit to 0 (if not already set) 

2. Load multiplier and multiplicand into TOP and NEXT 
3. Execute appropriate signed or unsigned "multiply" 
© 4. 
e 5. 


instruction (interrupts are disabled) 
Read lower result register (interrupts are disabled) 
Read upper result register (interrupts are disabled) 


The 32-bit product is now on the Parameter Stack, the most 
Significant 16 bits are in TOP, the least significant 16 bits are in 
NEXT, and interrupts are enabled. 


Example 2: A typical multiplication sequence with rounding 


Set ROUND bit to 1 (if not already set) 

Load multiplier and multiplicand into TOP and NEXT. 
Execute appropriate signed or unsigned "multiply" 
instruction (interrupts are disabled) 

Read upper result register, MHR (interrupts are disabled) 
Exchange TOP and NEXT registers (interrupts are disabled) 
Discard top stack item (interrupts are enabled) 


Sa | ee 


The 16-bit product is-now in the TOP Register and interrupts are 
enabled. 


112 RTX 2000 Family Programmer’s Reference Manual 


6.3.2 RTX 2010 Hardware Math Support 


In addition to an on-chip multiplier, the RTX 2010 provides 
additional hardware on-chip to support Multiply-Accumulate — 
operations, 32-bit shift operations, and Leading Zero Detection. 


6.3.2.1 | RTX 2010 Multiplier/Accumulator Operation 


The Hardware Multiplier/Accumulator (MAC) on the RTX 2010 
functions as both a Multiplier, and as a Multiplier-Accumulator. 


When used as a Multiplier alone, it multiplies two 16-bit numbers, 
yielding a 32-bit product in one clock cycle. 


When used as a Multiplier-Accumulator, it multiplies two 16-bit 
numbers, yielding an intermediate 32-bit product, which is then 
added to the 48-bit Accumulator. This entire process takes place in 
a single clock cycle. 


The MAC’s input operands come from three pews sources (see 
Figure 17): 


e The TOP and NEXT Registers 
e The Parameter Stack and memory 
e The ASIC Bus and memory 


These inputs can be treated as either signed (two’s complement) or 
unsigned integers, depending on the form of the instructions used. 

In addition, if the ROUND option is selected, the Multiplier can — 
round the result to 16 bits. Note that the MAC instructions do not _ 
pop the Parameter Stack; the contents of TOP and NEXT remain 
intact. — 


For the Multiplier, the product is read from the Multiplier High 
Product Register, MHR, which contains the upper 16 bits of the 
product, and the Multiplier Low Product Register, MLR, which 
contains the lower 16 bits. 
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For the Multiplier-Accumulator, the accumulated product is read 
from the Multiplier Extension Register, MXR, which contains the 
upper 16 bits, the MHR, which contains the middle 16 bits, and the 
MLR, which contains the low 16 bits. 


The registers may be read in any order, and there is no requirement 
that all registers be read. Reading from any of the three registers 
moves its value into TOP, and pushes the original value in TOP into 
NEXT. 


If the read is from MHR or MLR, the original value of NEXT is lost, 
i.e. it is not pushed onto stack memory. This permits overwriting 
the original operands left in TOP and NEXT, which are not popped 
by the MAC operations. 


If the read is from MXR, the original value of NEXT is pushed onto 
the stack. 


In addition to this, any of the three MAC registers can be directly 
loaded from TOP. This pops NEXT into TOP and the Parameter 
Stack into NEXT. 


If 32-bit precision is not required, the multiplier output may be 
rounded to 16 bits. The RTX 2010 ROUND mode functions 
exactly like the RTX 2000 ROUND mode. See Section 6.3.2.1 for 
details. 


The multiply instructions suppress interrupts during the 
multiplication cycle. Reading MHR or MLR also suppresses 
interrupts during the read. This allows a multiplication operation to 
be performed, and both the upper and lower registers to be read . 
sequentially, with no danger of a non-NMI interrupt service routine 
corrupting the contents of the registers between reads (for 
compatibility with the RTX 2000). The Multiply-Accumulate — 
instructions do not suppress interrupts during instruction execution. 
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6.3.2.2 RTX 2010 Barrel Shifter and LZD Operation . 


The RTX 2010 has both a 32-bit Barrel Shifter and an 32-bit 
Leading Zero Detector (LZD) for added floating point and DSP 

- performance. The input to the Barrel Shifter and Leading Zero 
Detector is TOP and NEXT. See Figure 6.9. 


_ FIGURE 6.9: RTX 2010 FLOATING POINT/DSP LOGIC 


The Barrel Shifter uses a five bit count stored in the MXR Register 
to determine the number of places to right or left shift the double 
word operand contained in TOP and NEXT. The output of the Barrel 
Shifter is stored in MHR and MLR, with the most significant 16 bits 
in MHR and the least significant 16 bits in MLR. 


The Leading Zero Detector is used to Normalize the double word 
operand contained in the TOP and NEXT Registers. The number of 
leading zeroes in the double word operand are counted, and the 
count stored in MXR. The double word operand is then logically 
shifted left by this count, and the result stored in MHR and MLR. 
Again the upper 16 bits are in MHR, and the lower 16 bits are in 
MLR. This entire operation is done in one clock cycle. 
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6.4 Counter/Timers 


The RTX 2000 Family of microcontrollers contains three identical 16-bit 
Counter/Timers. Each counter may be configured as either an external event 
counter, in which case its clock input comes from an RTX input pin, or as a 
timer, in which its clock input comes from the processor’s internal TCLK signal. 
Each Counter/Timer circuit consists of a pre-load register, a 16-bit down- 
counter, clock selection circuitry, and an interrupt output. See Figure 6.10. 


TIMER / 
COUNTER 


PRELOAD 
REGISTER LOAD INTERRUPT | JINTERRUPT 


TIMER / 
COUNTER | | CONTROLLER} | RESET. 


PRELOAD 
LOAD ERGY ACTIVATE 
REGISTER L 5 ~ wo Ae . TIMEOUT 
TP2 COUNTER INTERRUPT 


FIGURE 6.10: TIMER/COUNTER OPERATION 


6.4.1 Counter/Timer Operation 


Writing to a counter’s ASIC Bus address loads a 16-bit value into its pre-load 
register. This value is loaded into the counter on the counter’s next input clock 
cycle. Each subsequent input clock cycle decrements the counter by 1. The 
counters are free-running in that they do not stop when they reach 0, but rather 
reload from the pre-load register and continue counting. Loading a counter with 
0 is equivalent to loading it with 65536. 
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The counters clock synchronously with the processor’s internal TCLK signal. 
This prevents the clocking from occurring during an I/O read, and means that the 
contents of each counter may be read at any time without disturbing the count or 
interfering with the counting process. This also means that the processor clock 
must be running for counting (from either an internal or external clock) to take 
place, and that the maximum counting rate with an external clock source cannot 
exceed one-half the processor’s clock rate. 


When a counter is written, the value is not loaded until one TCLK or EI pulse 
later, depending on which is the source to the counter. | 


6.4.2 Counter/ Timer Interrupts 


Each counter generates an interrupt signal when it reaches 0. These signals are 
routed through ‘the Interrupt Controller, and may be masked by setting the 
appropriate bits in the Interrupt Mask Register. - 


The counter interrupts are reset during the corresponding Interrupt Acknowledge 
cycle. This means that it is possible that there will be an interrupt request 
present when the interrupt levels associated with each counter are unmasked, 

especially if the counters have been running for some time before being loaded 
with a count value. 
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6.4.3 Clock Selection 


The clock selection circuit determines the source for each counter input clock. 
Each counter may be clocked from either the processor’s internal TCLK signal, 
or from one of the processor’s External Input (ED pins. The 3 EI pins EI3, EI4, 
and EIS are shared with the interrupt controller. Each pin may either be an input 
to the Interrupt Controller, or a clock input to a counter. Bits 8 and 9 in the 
Interrupt Base/Control Register determine the usage of each pin. See Table 6.3. 


TABLE 6.3: TIMER/COUNTER EI PIN ASSIGNMENTS 


Caceres [wore [es [ew [es 


Notes: 
INTn - input to Interrupt Controller, level n 


CLKn - clock input to Counter/Timer n 


If a counter input is not assigned to an EI pin, it is decremented 


by the processor’s TCLK signal. 


If an Interrupt Controller input is not assigned to an EI pin, it 


is held inactive. 
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7 Instruction Set 


This section ieeribes each of the instruction operation codes ("opcodes") 
available.on the RTX processor. Sieg or is thé “assembly language" for the 
processor, the instruction set is described in terms of Forth primitives; 


Appendix A presents the opcodes in Forth format. This chapter presents each 
| of the instructions in terms of their stack and Fegistet effects. 
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7.1 : General asinine | 


Instructions are always aligned on word boundaries, with the most significant 
byte of the instruction at the even address, and the least significant byte at the _ 
next higher odd address. All instructions are 16 bits long, with the exception of 
long literals which require 16 bits for the instruction and 16 bits for the literal 
value. 


All RTX instructions execute in either one or two clock cycles. All instructions 
which do not perform memory accesses execute in a single clock cycle. © 
Instructions which perform memory accesses or load long literal data require two. 
clock cycles. This consistency of execution time makes it possible to write code 
with very predictable behavior. 
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| 2) 7.1.1 Streamed Execution Mode 


The RTX processor has a "streamed" instruction feature, in which an instruction 
is made to repeat a specified number of times by writing a count value into the 
Index Register. This feature is useful for doing fast data transfers, loops and 
some math functions. | 


The count is written into the Index Register using an ASIC Bus write instruction 
to the Index Register at ASIC Address 02H. See Section 4.3.1.3 for details. 
The value written must be 1 less than the desired number of repetitions. 


Only the first cycle of a two cycle instruction is repeated. The second cycle is 
performed only once, after the first cycle has been repeated the desired number 
of times. 


Interrupts are disabled during streamed instruction execution. Only a Non- 
maskable interrupt (NMI) will interrupt streamed execution. | 


The RTX 2010 siGeides the ability to set the NMI. MODE Flag (bit 11 of 
the CR Register). If this bit is set, (MODE1), then the NMI is suppressed 
until the streamed instruction has been completed. 


7.1.2 The Auto-decrementing Loop Instruction 


The RTX provides a fast auto-decrementing loop instruction called next. The 
NEXT instruction branches based on a count previously pushed onto the Return | 
Stack (in I). \ 


The NEXT branch instruction tests the contents of the I Register at the end of each 
loop. If the contents are not 0, the I Register is decremented, and a branch 
(typically to the beginning of the loop) is executed; if the I Register contains 0, 
the Return Stack is popped, and execution continues with the instruction 
following the conditional branch instruction. 
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7.2 Format 


> 
3 
‘eves 
oO 
stats 
oe 
SOX 
ae 
Dp 
a 


All PIOcEssor instructions are 16 bits, with the allowing general fields: 


<* 
Po enenNren 
“SBR RA SS 


2, c se g 
> yyw SISKSS 
YY? eneneeeee 

EN ene srs 


Class. _ General a of instruction: 
~~ .8°9 : > Branches and Loops 
10 : Math/Logic Functions 
11. : - Register and Short Literal Operations 
12. : User Memory Access 
13. : Long Literals | 
14: Memory Access By Word © 
15 : Memory Access By Byte 


Each class is discussed separately: 


ALU : ALU function to be performed. 


SC .-. Subclass. Function depends on Class field. 
R - Return bit. When set, causes a Return-From-Subroutine. 
| Data _- Depending on Class, indicates shift special: short literal data, 


ASIC Bus address, or memory address. 


Opcode descriptions use the format which follows. 
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PRIMARY CLASSIFICATION [Functional Title] 


Functional representation of the instruction 
- bits: {15 @eeseaontcdcegeoenee eves eadaa @eeesgeeas 8 eeseeeanvneeosaceoesaenannecee Seeseaesesee ss 


COPE binary 
Description: | Describes the net effect of the operation. 


Number of cycles: Total number of processor cycles required. 


Processor operations: | 
7 The register and memory operations performed by 

the processor during each cycle of the instruction. 
Notations: N => T Contents of NEXT Register (N), are written to TOP (T) 


—*N => T Contents of NEXT are written to TOP with 1’s 
7 : complement performed if "i" bit in instruction is 1 


N => (T) Contents of NEXT are written to the Memory location | 
addressed by TOP | 


N => Pstack Contents of NEXT are pushed onto the Parameter Stack 


Rstack => PC Return ‘Stack popped into Program Counter (PC) Register 


Parameter Stack effect: Net effect of instruction on Parameter Stack, shown 


before -- after 
Notations: 
| *a | 1’s complement of "a", if "i" bit is set 
a@-op-b the result of ALU operation Hop", performed between "a" 
and "b". The order of the operands is significant for — 
subtraction operations only. 
shiftfa] ' the result of performing the shift operation on value a. 


124 RTX 2000 Family Programmer ’s Reference Manual 


The right-hand item in each list is the top stack element. "T" and "N" are used . 
to represent the contents of TOP and NEXT before the instruction is executed. 
For example, | | | 


SEK RES SER, so 8 
| ee 
NT -—*T rg | — 
‘ : : : - Bese ‘ 
pee 


shows that the instruction starts with palnees in TOP and NEXT, and ends with the fe. 323) 
contents of NEXT being. discarded and the contents of TOP pengdavened,_ be rd , i yonent fly js 


When reading the "Processor eperations® descriptions, it is important to keep in 
mind that the RTX performs the indicated operations in parallel when executing 
an instruction. Thus, the original contents of a register may be used as an 
operand for an instruction even though the register is loaded with a new value ~ 
‘during execution of the instruction. 


For example, the contents of TOP and NEXT may be used as operands for a math 
operation which replaces the contents of TOP with the results of the operation and 
pops the Parameter Stack into NEXT. 


In the descriptions of Processor operations for two cycle instructions, the values 
shown for "T" and "N" during the second cycle of the instruction represent the 
values loaded into TOP and NEXT during the first cycle of the instruction, not the 
contents of TOP and NEXT before the instruction was executed. 

For example, 


Processor operations: : 


ist cycles — N => T m => WN 
— 2nd cycle: N = T T => N 
Parameter Stack effect: 
NT -- Nm 


should be interpreted as described on the following page. 
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ee _ During the first cycle, the contents of NEXT are written to TOP, overwriting the. 
-1 contents of TOP. At the same time, the contents (m) of the memory location 
addressed by the original contents of TOP are loaded into NEXT. 


During the second cycle, the new contents of NEXT (the memory data) are 
written into TOP, while the new contents of TOP (the original contents of NEXT) 
are written back into NEXT. The net effect of this operation is to replace the 
contents of TOP with the contents of the mene location addressed by. the . 
contents of TOP. - 
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7.3 Subroutine Call 


Any instruction which has bit 15 set to 0 will perform a Subroutine Call. The 
contents of the Code Page Register and the address of the instruction following 
the Call are pushed onto the Return Stack. The Program Counter Register is 
then loaded with the address contained in the instruction. 


The address bits in the instruction represent the word address to be executed. 
The actual address may be calculated by shifting the value left by 1 bit, and 
inserting a Q in the least significant bit. For example, an instruction code of 
2A45H would cause a call to location 548AH: 


2A45: 0010 1010 0100 0101 
shift. and insert 0: 0101 0100 1000 1010 = 548AH 


If a Subroutine Call is to be made to a Code page other than the one containing 


the Call instruction, the instruction immediately preceding the Call must load the 
correct page number into the Code Page Register. 


Description: Subroutine: Call. 
Number of cycles: 1 


Processor operations: 


IPR, | => Rstack Save return address on ) Return Stack 
PC >= I1 
CPR => IPR 
aaaaaaaaaaaaaaaN => PC Load Call address into Program Counter 


Parameter Stack effect: 


no change 
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PMO SMS 


roheta See! - 
eee 
a 


pees Ses 


a7, 4 Subroutine Return 


Any non-call/branch instruction which as the Subroutine Return bit (bit 5) Set 
will cause a Return-From-Subroutine operation. The Return Stack is popped into 
the Program Counter Register and Code Page Register, causing execution to _ 
resume with the instruction following the call to the current subroutine. The - 
Subroutine Return bit is we in the oe formats as "R". | 


nies Return-From-Subroutine. 


Number of cycles: _1 if coded as a separate instruction; 0 if coded as 
. part of the last instruction in a subroutine 


‘Processor operations: 


1 => PC IPR => CPR ae => 1, IPR 
Parameter Stack effect: | 
no change 
The Subroutine Return bit may not 3 used i in the following circumstances: 
e UA Branch or Call instruction. All bits of the instruction are significant. | | 
© Any instruction which pops ‘the Return Stack. "Return Stack pop" 


instructions which have the Return bit set Deve as a "Index 
Register Read" instructions. | 


In these situations, a stand-alone return saseraedon must be added as the last 


instruction of the subroutine. This would typically be a No oe (NOP) | 
instruction with the Return bit set. | 
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7.5 Classes 8 and 9: Branches and Loops 


These instructions cause either a conditional or unconditional branch. The RTX 


Branch Instruction treats each Code Memory page as 64 "blocks" of 512 words 


-each. Bits 15-10 of the Program Counter determine the block number; bits 9-0 


determine the word offset within the block. 


In order to perform branches in a single clock cycle, RTX branch instructions 
encode the branch address within the instruction. 


The limited number of bits available for encoding the address requires that all 
branch destinations must be within the same, next, previous, or first memory 
block. Except for the "Branch to block 0" instruction, the longest branch which 
the processor can perform is + 1K words. 


RTX branch instructions have the following general form: 


"ec" - Determine conditions for branching. See Table 7.1. 
"pbaaaaaaaaa" - Branch address. 
"bb" - Block Select. Determines new value of bits 15-10 of 


Program Counter. See Table 7.2. 


“aaaaaaaaa"  - Replaces bits 9-1 of Program Counter (word offset 
from address 0 in the new block). 


Bit 0 of the Program Counter Register is set to 0 (word aligned instructions). 
The resulting branch address is designated "ADR" in the instruction descriptions. 
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TABLE 7.1: _BRANCH CONDITIONS - 
[= [eet eration 
Branch if contents of TOP = 0. Don’t pop stack. ; 
Branch if contents of TOP = 0. Pop stack. 


Unconditional branch 


1.| If contents of Index Register #0, branch and decrement | 


Branch to next memory block (add 1 to value represented 
by bits 15- -10) 


Branch to Block 0 (set bits 15-10 to 0) . 


Branch to previous block (add -1 to value represented by 
bits 15-10) | 


The most important thing to note when calculating the address field for a branch 
instruction is that, when the branch instruction is executed, the Program Counter 
will already be pointing to the instruction following the branch instruction. The 
"bb" field will be . applied to this address, not the address of the branch 
instruction. 


This is only important when the branch instruction is the last instruction in a 512 
word block. In this case, the Program Counter is already pointing to the first 
word in the next block, and the "bb" field must be calculated based on that block 
number, not the block containing the branch instruction. 


Example: A branch instruction is located at address 0O7FEH, the last instruction 
in block #1 (bits 15-10 = 000001). When this instruction executes, the Program 
Counter is pointing to the instruction at O800H, the first instruction in block #2. 

- To perform a branch to a location in block #2, the “bb" field must be set to 00 
(branch to same block) rather than 01 (branch to next block). 
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Branch Address Examples: (bbaaaaaaaa = bits 0 - 9 of opcode) 


Example 1: Branch to same block 
@ address of branch instruction: 0001 0100 1010 0100 
: address to branch to: 0001 0100 1111 0000 
bbaaaaaaaaa : 1001 000 001111000 
resulting address: 
PC-Register bits 15-10 => 0001 01 | 
PC-Register 00 1111 000 
PC-Register bi 


Final branch address 0001 0100 1111 0000 


Paes 2: Branch to next block 7 SO 
address of branch instruction: 0001 0100 1010 0100 
address to branch to: 0001 1000 0101 1110 
bbaaaaaaaaa : 1001 001 000101111 
resulting address: 

PC-Register bits 15-10 => 0001 01 


0001 10 
PC-Register bits 9-1 00 0101 111 
PC-Register bit , 0 


Final branch address 0001 1000 0101 1110 


Example 3: Branch to block 0 
address of branch instruction: 0001 0100 1010 0100 
address to branch to: 0000 0000 1100 1010 
bbaaaaaaaaa : 1001 010 001100101 
resulting address: 

/ PC-Register bits 15- Gis => 0000 00 


PC-Register bits 9-1 00 1100 101 
PC-Register bit 0 


Final branch address => 0000 0000 1100 1010 


"Example 4: Branch to previous block 
address of branch instruction: 0001 0100 1010 0100 
address to branch to: 0001 0000 1101 1110 
bbaaaaaaaaa : — 1001 011 001101111 
resulting address: 
PC-Register bits 15-10 => 0001 01 


: _ -=> 0001 00 
PC-Register bits 9-1 ~ 00 1101 111 
PC-Register bit 0 


Final branch address => 0001 0000 1101 1110 
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BRANCH Unconditional Branch 


Unconditional Branch 


4 + 
4 


Description: Branch to address indicated by bbaaaaaaaaa. 
‘Number of cycles: 1 


Processor operations: 


ADR => PC 


Parameter Stack effect: 


no change 
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BRANCH Branch if T=0, Pop stack 


Branch if T=0, Pop stack 


\ 


Description: 

IfT =0 Performs branch. Pops the Parameter Stack. 
If T # 0 Pops the Parameter Stack. | 

Number of cycles: 1 


Processor operations: 
If T = 0 N => T Pstack => N ADR => PC 


If T + 0 N => T Pstack => N 


Parameter Stack effect: 
If T = 0 NT--N 
If T # 0 | | NT --N 
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BRANCH Branch if T=0, don’t pop stack 


oe ratarates na 
SORES PPR PS S 
DODO ASHES 
¢ ss eee reneratee eats 
Ses 6 revesecesnotenee 
SER rea 0 ose 

rece: SAN ores 
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Branch if T=0, don’t pop stack 


Description: | 

If T = 0 Performs branch. Pops the Parameter Stack. 
If T + 0 No effect. 

Number of cycles: 1 


Processor operations: 


ifT = 0 N => T Pstack => N ADR => PC 


fT #0 © no operation 


Parameter Stack effect: 


IifT = 0 NT--N 
f#T#O | NT --NT 
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NEXT 


Branch if I # 0 


Description: 


Number of cycles: 


This branch instruction is referred to as the "next" 
instruction, and is useful for implementing a fast auto- 
decrementing loop. 


Branch and decrement the Index Register (I), if I is not equal 
to 0. | 


If I contains 0, the Return Stack is popped. Execution 
continues with the next sequential instruction. 


Processor operations: 


If I #0 
Ifl=0 


I1-1 => 1] ADR => PC 


Rstack => | 


Parameter Stack effect: 


no change 
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7.6 Class 10: ALU Operations 


This class of instructions allows the processor to perform arithmetic and logic 
Operations between the contents of the TOP and NEXT registers. These 
operations fall into two general categories: Single step and Multi-step. Multi- 
step Math operations are discussed in Chapter 8. The single step operations 
category covers those functions which may be completed in one clock cycle: 


Addition | _. . Stack manipulations 

Subtraction ‘Boolean logic operations 

1-bit shifting (*2.and /2) 3 | 
All ALU operations are performed between the contents of the TOP Register and 
another operand which is determined by the instruction. The results of the 
operation are loaded into TOP. The ALU function to be performed is encoded 
as a field in the instruction and is shown in the opcode formats as either "ccec" 
or “aaa”. . 4 ag : | 


Table 7.3 lists the ALU functions the RTX can perform. "T" indicates the 
contents of the TOP Register. “Y" indicates the source for the second ALU 
input. For single step math functions, Y is always the NEXT Register. For other 
classes of instructions, the source for Y will vary, depending on the instruction. 


The "Resulting Carry" column indicates the new value which will be latched into 
the processor’s Carry bit as a result of the operation. y 7 


TABLE 7.3: RTX ALU FUNCTIONS 


}ecce | | aaa function Resulting Carry 


0010 001 |} T AND Y YY no change 
i” no change 


ALU carry 
‘ with borrow ALU carry 

no change 
no change . 
ALU carry 
with carry ALU carry 
OR no change 
NOR no change 


y= T) - ALU carry 
- VY with borrow ALU carry 


oR 
NAND 


+ 
+ 


x< 
<< ~~ 


SHH 44 4 
es 
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7.6.1 Carry Bit 


The Carry-out signal from the ALU is bit 0 (CY) of the Configuration Register, ec ss 
CR, and may be used for performing multi-precision addition and subtraction’ Fe = 
operations. The Configuration Register bit may be directly set or read under Feo 

program control. | | 


All addition and subtraction operations set the carry bit, but only the "add with 
carry" (cccc = 1101, see Table 7.3), “subtract with borrow" (cccc = 1101) and 
"swapped subtract with borrow" (cccc = 0101) use the value of the carry bit 
during calculations. None of the Boolean logic functions use or affect the carry. 


Addition operations add the two ALU inputs, then optionally add the Carry-in 
bit (CY) to the least significant bit (LSB) of the sum. The Carry-out bit of the 
ALU becomes the new value for CY; 1 indicates an overflow out of the most 
significant bit (MSB). 


Subtraction operations add the minuend (A in the examples below) to the 1’s 
complement of the subtrahend (B in the examples), then optionally add the Carry- | 
in (borrow) bit to the LSB of the sum. The Carry-out of the ALU indicates the 
borrow status; CY = 0 means that the result of the subtraction was negative and ~ 
that a borrow should be performed from the next most significant stage of the 
subtraction. . | 


TABLE 7.4; Examples; Cout = ALU Carry-out 


A B  Carry-in  — without carry/borrow with carry/borrow — 
: A+B Cout A-B Cout A+B Cout A-B Cout 
0 0 0 0 0 0 1 0 0 -1 0 
0 60 1 0 0} 0 1 1 90 0 1 
0 1 0 1 O -1 0 1 0 -2 0 
0 1 1 1 0 -1 0 2 0 “1 0 
1 0 0 1 0 1 1 1 0 0 1 
1 OO 1 1 0 1 1 2 0 1 1 
1 #1 0 2 0 0 1 2 0 -1 0 
1 1 1 2 .0 0 61 3. 60 0 1 
-1 0 0 -1 #0 “11 -1 «(0 -2 (1 
-1 #0 41 -1 0 -1 1 o 1 -1 1 
-1 1 0 0 1 -2 «1 0 1 = ae 
a 1 0 1 “2.1 1 1 oe 1 


Instruction Set, Chapter 7 | — 137 


ane s nye 
>: sees 
— = 
atone’ 3) 
3509 


I 62 Shift Operations 
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The single step math/logic functions allow the output of the ALU to be shifted © 
as a 16-bit quantity, or the output of the ALU and the contents of the NEXT 
Register to be shifted as a 32-bit quantity in either direction before being loaded 
into the TOP (and NEXT) registers. 
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The shift function is embedded in the instruction, and is shown in the opcode 
formats as "ssss". Each of the shift functions is described in Tables 7.5 and 7.6, 
which use the following notations: 


Zn. Bit n of ALU output (15 - 0) 
TNna Bit n of NEXT Register before shift (15-0) 
CY Old value of Carry bit as a result of ALU operation 
C New value of Carry bit as a result of the shift operation | 
T15,Tn,T®@ MSB, typical bit, and LSB of TOP Register after the shift : 
| operation 
N15,Nn,NO MSB, typical bit, and LSB of NEXT Register after the shift 
| operation 


The first 8 shit functions affect only the TOP Register. The remaining shift 
functions affect either just the NEXT Register, or the TOP and NEXT registers — 
combined as a 32-bit quantity. 


Inthe 32-bit form, the TOP Register represents the most significant word of the 
32-bit quantity and the B NEXT Register the least significant. 
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TABLE 7.5: 16-BIT SHIFT FUNCTIONS 


effect 


no shift operation is performed 


Sign extend: The sign bit (bit 
15) of TOP is propogated to all 
bit positions in TOP. 


Left Shift: TOP is shifted 

left by 1 bit, with 0 shifted 
into the LSB. MSB is shifted 
{nto the carry bit. 


,Rotate Left: TOP is shifted left 
by 1 bit, with the carry bit 
shifted into the LSB. MSB is 
shifted into the carry bit. 


Right Shift Out of Carry: TOP is 
shifted right by 1 bit, with the 
carry bit shifted into the MSB. 
The LSB is discarded and 0 is 
shifted into the carry bit. 


Rotate Right Through Carry: TOP 
is shifted right by 1 bit, with 
the carry bit shifted into the 
MSB. The LSB is shifted into 

the carry bit. 


Logical Right Shift: TOP is 
shifted right by 1 bit, with 0 
shifted into the MSB and carry 
‘bits. The LSB is discarded. 


Arithmetic Right Shift: Bits 
14-1 of TOP are shifted right by 
1 bit. Bit 15 remains unchanged 
and is shifted into the carry 
bit and bit 14. The LSB is 
discarded. 


Left Shift of NEXT: NEXT is 
shifted left by 1 bit, with 0 
shifted into the LSB. TOP and 
the carry bit are unchanged. 


Rotate NEXT Left: NEXT shifts 
left 1 bit, with the carry bit 
shifted into the LSB. TOP and 


' the carry bit are unchanged. 
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_ TABLE 7.6: 32-BIT SHIFT FUNCTIONS 


Status} TOP Resietes 
effect of C | 


32-bit Left Shift: TOP and 214] Zn-11TN15 | TN14] TNn-1 
NEXT are shifted left 1 bit, 

with the MSB of NEXT shifted into 

the LSB of TOP, the MSB of TOP 

shifted into the carry bit, and 

0 shifted into the LSB of NEXT. 


32-bit Rotate Left: TOP and NEXT TN14] TNn- 11 CY | 
are shifted left 1 bit, the MSB_ |. 

of NEXT is shifted into the LSB 

of TOP, the carry bit is shifted 

into the LSB of NEXT, and the 

MSB of TOP is shifted into the 

carry bit. 


32-bit Right Shift Out of Carry: 
TOP and NEXT are shifted right by 
1 bit, the carry bit shifts into 
the MSB of TOP, the LSB of TOP 

is shifted into the MSB of NEXT, 
the LSB of NEXT is discarded, 

and 0 shifts into the carry bit. 


32-bit Rotate Right Through Carry:| 
TOP and NEXT are shifted right by 
1 bit, the carry bit shifts into 
the MSB of TOP, the LSB of TOP is 
shifted into the MSB of NEXT, and 
LSB of NEXT shifts into the carry 


32-bit Logical Right Shift: TOP 
and NEXT are shifted right 1 bit 
with 0 shifted into MSB of TOP 
and the carry bit, the LSB of TOP 
is shifted into the MSB of NEXT, 
and LSB of NEXT is discarded. 


32-bit Arithmetic Right Shift: 
Bits 14-0 of TOP and all of NEXT 
are shifted right 1 bit: Bit 15 
of TOP remains unchanged and is 
shifted into the carry bit and 
bit 14. The LSB of TOP is shifted 
into the MSB of NEXT; the LSB of 


NEXT is discarded. 
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ALU/SHIFT OPERATIONS | Shift T 


Invert/Shift T 


Description: 
Ifi=0 Performs shift operation ssss. Original contents of NEXT are 
- left intact unless affected by shift operation. 
Ifi=1 Inverts TOP, performs shift operation ssss. Original contents 
of NEXT are left intact unless affected by shift operation. 
Note that if both i and ssss are Q, this is a 1-cycle No 
Operation (NOP) instruction. ? 
Number of cycles: 1 


Processor operations: 
Ifi=0 shift{T] => T 


Ifi=1 shift[*T] => T 


Parameter Stack effect: 
Ifi=0 T -- shift[T] 


Ifi=1 T -- shift(*T] 
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| | ALU/SHIFT OPERATIONS —s N= > T, Invert/shift 
| | | i | 


N= >T, Invert/shift 


Description: 

Ifi = 0 Loads contents of TOP with contents of NEXT, then performs 
shift operation ssss. Original contents of NEXT are ° left intact 
unless affected by a shift operation. | 

Ifi=1 Loads contents of TOP with contents of NEXT, cavern the 

: value, then performs shift operation ssss. Original contents 
_ of NEXT are left intact unless affected by a shift operation. 
: Number of cycles: 1 


Processor operations: 
Ifi=0 —— shifttm => T 


Ifi=1 shift(*N]) => T 


Parameter Stack effect: 
Ifi = 0 NT -- N shift iN 


Ifi=1 NT -- N shift (*N] 
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ALU/SHIFT OPERATIONS T-op-N Shift 


T-op-N Shift 


Description: Loads TOP with results of ALU operation cccc and shift 
operation ssss on TOP and NEXT registers. Original contents 
of NEXT are left intact unless affected by shift operation. 


Number of cycles: 1 


Processor operations: 


shift{T-op-N] => T 


Parameter Stack effect: 


NT -- N. shift{[T-op-N] 
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ALU/SHIFT OPERATIONS _Invert/Shift, Pstack= >N 


Invert/Shift, Pstack= >N 


oe « ie 


Description: 
Ifi = 0 Performs shift operation SSss On TOP and original contents 
| Of NEXT. Pops stack into NEXT. 
Ifi=1 Inverts TOP and performs shift operation ssss on TOP and 
Original contents of NEXT. Pops stack into NEXT. 
Number of cycles: ey 


Processor operations: 


Ifi=0 Pstack => N | shift lt] 


a0 
v 
—_{ 


Ifi=l Pstack => NO shift (*T) 


al 
Vv 
~ 


Parameter Stack effect: 
Ifi = 0 NT shifter 


Ifi=1 NT -- shift[*T] 
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-ALU/SHIFT OP =———- N=>T, Invert/Shift, Pstack= >N 


N= >T, Invert/Shift, Pstack= >N eS 
tfopsfopttsytfapolt{efefs|s}s]s| 
Description: 
If i = 0 Moves NEXT into TOP, performing shift operation SSSS. 


Pops stack into NEXT. 
Ifi=1 | Moves NEXT into TOP, inverting it, and performing shift 
) a ssss. Pops stack into NEXT. 


Number of cycles: | 1 


Processor operations: 
Ifi=0 shift(N] => T Pstack => N 


Ifi=1 _— shiftr*n) => T Pstack => N 


Parameter Stack effect: 
Ifi=0 NT -- shiftiny 


Ifi=1 NT -- shiftlen] 
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ALU/SHIFT OPERATIONS __T-op-N, Shift, Pstack= >N 


T-op-N, Shift, Pstack= >N 


1 


Description: Loads TOP with results of ALU operation cccc and 
shift operation ssss on TOP and NEXT registers. 
Pops stack into NEXT. 


Number of cycles: 1 


Processor operations: 


3 


shift [T-op-N] => T Pstack => N 


Parameter Stack effect: 


NT -- shift{T-op-N] 


146 | RTX 2000 Family Programmer's Reference Manual 


ALU/SHIFT OPERATIONS -T=>N, Invert/Shift 


T= >N, Invert/Shift - 


. 


Description: 
Ifi =0 Copies TOP into NEXT, replacing original contents of NEXT. 
Performs shift operation ssss. 
Ifi=1 - Copies TOP into NEXT, inverting TOP and replacing original 
- contents of NEXT. Performs shift operation ssss. 
Number of cycles: 1 


Processor operations: 


Ifi=0 T => N shift(T] => T 
If i 


ll 
jh 


T => N shift(*T] => T 


Parameter Stack effect: 
Ifi = 0 NT -- T shiftlT] 


Ifi=1 NT -- T shift[*T] 
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ALU/SHIFT OPERATIONS = T<=>N, Invert/Shift 


T<=>N, Invert/Shift 


Description: 

Ifi=0 Exchanges the contents of TOP and NEXT, then performs 
_shift operation ssss. 

Ifi=1 _ Exchanges the contents of TOP and NEXT, inverting TOP 
(original contents of NEXT) then performs shift operation 
SSSS. 

Number of cycles; = | 


Processor operations: 
Ifi = 0 T => N _ % shiftIN] => T 


Ifi=1 T => N shift(*N] => T 


fi=0 NT o= T shift tn] 


Ifi=1 NT -- T shiftlen] 
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ALU/SHIFT OPERATIONS —____T-op-N, T=>N, Shift 


T-op-N, T= >N, Shift 


Description: Loads TOP with results of ALU operation cccc and shift 
/ Operation ssss on TOP and NEXT registers. Loads NEXT 
with original contents of TOP. 


Number of cycles: — 1 


Processor operations: 


T => N T-op-N => T 


Parameter Stack effect: 


NT -- T shift [T-op-NJ 
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N= >Pstack, T= >N, Invert/Shift 


- Description: 
Wfi=0 Pushes original contents of NEXT onto stack, copies TOP into © 
| NEXT, and performs shift operation ssss. _ 
Ifi=1 | Pushes original contents of NEXT onto stack, copies TOP into 
| NEXT, inverts TOP, and performs shift operation ssss. 
Number of cycles:  —s_ 1. 
Processor operations: 
Ifi=0 | N => patack T => N shift(T] => T 
Ifi=1 N => Pstack T => N oshift(*T] => 1 


Parameter Stack effect: 
Wi=0 | = 239 wt--WT shifecr 


Ifi=1 NT -- NT shiftt*T 
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ALU/SHIFT OP =§_N=>Pstack, T<=>N, Invert/Shift 


N= > Pstack, T< = >N, Invert/Shift 


Description: 

Ifi = 0 Pushes NEXT onto stack, pushes TOP to NEXT, copies 
original contents of NEXT to TOP, and performs shift 
Operation ssss. 

Ifi=1 . Pushes NEXT onto stack, pushes TOP to NEXT, and copies 
original contents of NEXT to TOP. Inverts TOP (original 
contents of NEXT), and performs shift operation ssss. 

Number of cycles: 1 


Processor operations: 
Ifi = 0 N => Pstack T => N shift(N] => T 


Ifi=1- N => Pstack T => ON. shift(*N] => T 


Parameter Stack effect: 
Ifi=0 NT -- NT shiftiNn] 


Ifi=1— - NT -- NT shiftf*ny 
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ALU/SHIET OPERATIONS N=>Pstack, T-op-N, Shift 


N= >Pstack, T-op-N, Shift 


Description: — Pushes NEXT onto stack, pushes TOP into NEXT, 
: loads TOP with results of ALU operation cccc and 
shift operation ssss on-original contents of TOP and 

NEXT registers. | 


Number of cycles: 1 


Processor operations: 


N => Pstack TS NO  shift{T-op-N] => T 


Parameter Stack effect: 


NT -- NT shift[T-op-N] 


4 
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SISO. 


7.7 Enhanced Processor-Specific Operations 


Each member of the RTX 2000 Family of Microcontrollers has on-chip 
hardware which is specifically designed to support operational requirements 
in the field of applications for which that Microcontroller is intended. 


OS oS 
OS my we myx 
> e. o Bae 
BEES Rs CREAR SRL 
SEE SEOSSELS NAA 
oy eS BINGO 
ORS Regen 
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Utilization of these microprocessor hardware features to achieve enhanced 
performance is possible through use of the product specific instructions for 
each microcontroller. 
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"Unsigned Multiply 


- Description: The Unsigned Multiply operation is initiated. The contents 
of the TOP and NEXT registers are multiplied, with the 32-bit 
result available in the Multiplier output registers MHR, MLR. 
Interrupts are disabled during the execution of this 
instruction. This instruction does not modify the stack. 


Number of cycles: i 


Processor operations: 


T*N => MHR:MLR 


Parameter Stack effect: 


no effect 
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RTX 2000 Specific Instructions co Signed Multiply 


MULS 


Description: |= = The Signed Multiply operation is initiated. The contents of 
the TOP and NEXT registers are multiplied, with the 32-bit 
result available in the Multiplier output registers MHR, MLR. 
Interrupts are disabled during the execution of this 
instruction. This instruction does not modify the stack. 


4 


Number of cycles: 1 


Processor operations: 


T*N => MHR:MLR 


Parameter Stack effect: 


no effect 
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_= | RTX 2000 Specific ‘Read Multiplier High Register 


MHR@ > 


Description: The middle 16 bits of the Multiplier High Register (MHR) 
are pushed onto the Parameter Stack. The contents of TOP 
aré pushed into NEXT, but NEXT is not pushed onto the 
stack; the contents of NEXT are lost. Interrupts are disabled 
during the execution of this instruction. | 


Number of cycles: 1 


Processor operations: 


MHR => T : T => N 


Parameter Stack effect: 


NT -- T MHR 
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RTX 2000 Specific Read Multiplier Low Register 


MLR@ 


Description: The low 16 bits of the Multiplier Low Register (MLR) are 
pushed onto the parameter stack. The contents of TOP are 
pushed into NEXT, but NEXT is not pushed onto the stack; 
the contents of NEXT are lost. Interrupts are disabled during 
the execution of this instruction. 


Number of cycles: 1 


Processor operations: 


MLR => T T => WN 


Parameter Stack effect: 


NT -- T MLR 
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RTX 2001A Specific Instructions _ Increment rx 


Increment RX 


Description: Increments the contents of RX by one. Incrementing the 
contents of the register beyond FFFF Hex results in a wrap 
to 0000 Hex. 

Number of cycles: 1 


Processor operations: 


(RX) -> (RX) + 7 


Parameter Stack effect: 


no change 
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RTX 2001A Specific Instructions Decrement RX 


‘Decrement RX 


Description: Decrements the contents of RX by one. Decrementing the 
contents of the register beyond 0000 Hex results in a wrap 
to FFFF Hex. 

Number of cycles: 1 


Processor operations: 


(RX) -> (RX) - 1 


Parameter Stack effect: 


no change 


Instruction Set, Chapter 7 | | 159 


RTX 2010 Specific Instructions | (= 


Description: 

If TOP = 0 Change TOP to FFFF (implement Forth 0=). 
If TOP + 0 Change TOP to 0000 (implement Forth 0=). 
Number of cycles: 1 


Processor operations: 
IfTOoRP=0O ~sFFFF => 1 


If TOP + 0 0000 => T 


Parameter Stack effect: 
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RTX 2010 Specific Clear MAC Accumulator : 
CLEARACC 
ttettisfefefofopefolejofs| fol 
Description: Clear the MAC Accumulator (MXR, MHR, MLR). 
Number of cycles: 1 


Processor operations: 


0 => MLR 0 => MHR 0 => MXR 


Parameter Stack effect: 


no change 
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= | RTX2010Specific | -—dDouble Shift Left Logical 


| : DSLL | 
BOOUCCOUOGRO Roe 
Description: Double. Shift Left Logical Shift the double word sell 


in TOP and NEXT left logically by the 5-bit count stored in 
the MXR Register. The result is stored in MHR and MLR 


Number of cycles: = 1 
Processor operations: 

“DSLL (T:N) => MHREMLR 
‘Parameter Stack effect: — 


no effect 
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RTX 2010 Specific Double Shift Right Arithmetic 


DSRA | 
ROBO moo 


Description: Double Shift Right Arithmetic Shift the double word 
Operand in TOP and NEXT right arithmetically by the 5-bit 
count stored in the MXR Register. The result is stored in 
MHR and MLR. 


Number of cycles: 1 


Processor operations: 


DSRA (T3N) => MHR:MLR 


Parameter Stack effect: 


no effect 
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RTX 2010 Specific Double Shift Right Logical 


| DSRL 
PPEPP PPP PPP rrr 
Description: Double Shift Right Logical Shift the double word operand 


in TOP and NEXT right logically by the 5-bit count stored in 
the MXR Register. The result is stored in MHR and MLR. 


Number of cycles: 1 
Processor operations: 


DSRL (T:N) => MHRSMLR 


Parameter Stack effect: 


no effect 
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RTX 2010 Specific Store MAC High Register 


‘MHR! 
yoptfspetstsfafsfofetspofataty 


Description: Store the contents of TOP into the MAC Accumulator MHR. 
NEXT is popped into TOP and Pstack is popped into NEXT. 


Number of cycles: 1 


Processor operations: 


T => MHR N => T Pstack => N 


Parameter Stack effect: 
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RTX 2010 Specific. Read Multiplier High Register 


Tete t fs tstsefofofefs poppet 
Description: The middle 16 bits of the MAC register (MHR) are pushed 


onto the Parameter Stack. The contents of TOP are pushed 

- Into NEXT, but NEXT is not pushed onto the stack; the 
contents of NEXT are lost. Interrupts are disabled during the 
execution of this instruction. 


Number of cycles: _ 1 


Processor operations: 


MHR => T T => N 


Parameter Stack effect: 


NT -- T MHR 
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| . . . . 
RTX 2010 Specific Store MAC Low Register | _ 


MLR! 


Description: - Store the contents of TOP into the MAC Accumulator MLR. 
NEXT is popped into TOP and Pstack is popped into NEXT. 


Number of cycles: 1 


Processor operations: 


T => MLR N => T Pstack => N 


Parameter Stack effect: 


MLR -- 
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RTX 2010 Specific Read Multiplier Low Register 


MLR@ > 


’ 


_ Description: The low 16 bits of the MAC register (MLR) are pushed onto 
| the parameter stack. The contents of TOP are pushed into 
NEXT, but NEXT is not pushed onto the stack; the contents 
of NEXT are lost. Interrupts are disabled during the - 
execution of this instruction. | 


Number of cycles: . a 


Processor operations: 


MLR => T T => N 


Parameter Stack effect: 


NT -- T MLR 
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RTX 2010 Specific Mixed Sign Multiply Accumulate — 


MULACM. 
DOR OBR D OO OE 


Description: The Mixed Mode (signed and unsigned) Multiply 
| Accumulate operation is initiated. The contents of the TOP 
and NEXT registers are multiplied (TOP contains the signed 
value and NEXT contains the unsigned value), the 32-bit 
result is added to the 48-bit accumulator (MXR, MHR, MLR). 
This instruction does not modify the stack. 


Number of cycles: 1 


Processor operations: 


(T*N)+MXR:MHR:MLR => MXR:MHR:MLR 


Parameter Stack effect: 


‘no effect 
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RTX 2010 Specific | Signed Multiply Accumulate 


MULACS 


Description: | A Signed Multiply Accumulate operation is initiated. The 
contents of the TOP and NEXT registers are multiplied, the 
32-bit result is added to the 48-bit acumulator (MXR, MHR, 
MLR). This instruction does not modify the stack. 


Number of cycles: 1 


Processor operations: 


(T*N)+MXR:MHR:MLR => MXR:MHR:MLR 


Parameter Stack effect: 


no effect 
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RTX 2010 Specific Unsigned Multiply Accumulate 


| MULACU 
ToT Tsfofefotsfofofet fo] ]s fo 


Description: The Unigned Multiply Accumulate operation is initiated. — 
The contents of the TOP and NEXT registers are multiplied, 
the 32-bit result is added to the 48-bit accumulator (MXR, 
MHR, MLR). This instruction does not modify the stack. 


Number of cycles: 1 


Processor operations: 


(T*N)#+MXR:MHR:MLR => MXR:MHR:MLR 


Parameter Stack effect: 


no effect 
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_ RTX 2010 Specific Mixed Sign Multiply 


MULM 


Description: - The Mixed Sign Multiply operation is initiated. . The 
contents of the TOP and NEXT registers are multiplied, with 

the 32-bit result available in the MAC output registers MHR, 

MLR. The operand in TOP is assumed to be signed and the 

operand in NEXT unsigned. Interrupts are disabled during 

the execution of this instruction. This instruction does not 

modify the stack. : | 


Number of cycles: | 1 


Processor operations: 
T*N => MHR:MLR 
Parameter Stack effect: 


no effect | 
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RTX 2010 Specific Signed Multiply 


Re ee 


Description: | The Signed Multiply operation is initiated. The contents of 
| -. the TOP and NEXT registers are multiplied, with the 32-bit 
result available in the MAC output registers MHR, MLR. 
Interrupts are disabled during the execution of this 

instruction. This instruction does not modify the stack. 


Number of cycles: _ 1 


Processor operations: 


T*N => MHR:MLR 


Parameter Stack effect: 


no effect 
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RTX 2010 Specific. Signed Multiply And Subtract 


Description: — The Signed Multiply and Subtract from Aceimuulator 
operation is initiated. The contents of the TOP and NEXT | 
registers are multiplied, the 32-bit result is subtracted from 

_ the 48-bit accumulator (MXR, MHR,. MLR). This 3 instruction | 
does not modify the stack. 


Number of cycles: 1 


Processor operations: 
| MXR:MHR:MLR-(T*N) => MXR:MHR:MLR 
Parameter Stack effect: 


no effect 
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RTX 2010 Specific Unsigned Multiply 


MULU 
L'teftttpofofotifsfofeltfojrisj oy 


Description: The Unsigned Multiply operation is initiated. The contents 
of the TOP and NEXT registers are multiplied, with the 32-bit 
result available in the MAC output registers MHR, MLR. 
Interrupts are disabled during the execution of this 
instruction. This instruction does not modify the stack. 


Number of cycles: | 1 


Processor operations: 


_T*N => MHR:MLR 


Parameter Stack effect: 


no effect 
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RIX 2010 Specific — 


Description: 


176 


Read MAC Extension Register | 


MXR@ 


Pushes the contents of the 16-bit extension register of the 
- MAC output onto the parameter stack.. 


Number of cycles: | 1. 


Processor operations: — 


MXR => T T = N N => Pstack 


~ Parameter Stack effect: 


-- MXR 
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RTX 2010 Specific Store MAC Extension Register 


SHR 
Pat 


See 
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Description: Stores the contents of TOP into the MAC Accumulator MXR. 
| NEXT is popped into TOP and the Parameter Stack is popped 
into NEXT. 


Number of cycles: | 


Processor operations: 


T => MXR. N => T Pstack => N 


Parameter Stack effect: 


MXR -- 
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RTX 2010 Specific => Normalize 


NORM 


Description: __ The Normalize operation counts the number of leading zeros 
: in the double word operand in TOP and NEXT. This count 
is stored in MXR. TOP and NEXT are also shifted left 
logically by this count to eliminate all leading zeros. The 
shifted result is in MHR and MLR. , 


Number of cycles: 1 


Processor operations: 


NORM(T:N) => MHR:MLR Count => MXR 


Parameter Stack effect: 


no effect 
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RTX 2010 Specific Shift MAC Register Right 


RSACC 


Description: This operation shifts the MAC output registers right (MXR - 
> MHR, MHR -> MLR, sign fills MXR, contents of MLR 
are lost. ) This is useful in implementing double precision 
multiply operations. 


Number of cycles: 1 


Processor operations: 


MXR => MHR ' MHR => MLR signfill => MXR 


Parameter Stack effect: 


no effect 
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RTX 2010 Specific ~ Streamed MAC - ASIC/Memory 


BO I 
ERR 
oomoncneneteneiet 


~ SMACA | 


Description: © Streamed MAC between ASIC Bus and Memory is an 
| instruction which indicates to the processor that the next 
instruction is a streamed instruction that will initiate a 
streamed MAC between the ASIC bus and memory. See 
Section 7.7.1 for detailed information. : 


Number of cycles: 1 


Processor operations: 


N/A 


Parameter Stack effect: 


no effect 
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RTX 2010 Specific Streamed MAC - Stack/Memory 


SMACS 


Description: Streamed MAC between Stack and Memory is an instruction 
_. Which indicates to the processor that the next instruction is 
a Streamed instruction that will initiate a streamed MAC 
between the stack and memory. See Section 7.7.1 for 
detailed information. 


Number of cycles: 4 
Processor operations: 

N/A 
Parameter Stack effect: 


no effect 
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7.7.1 Streamed MAC Instructions On The RTX 2010 


One of the features of the RTX-2010 is the ability to perform a high speed 
(one clock cycle per iteration) multiplication and accumulation of two 
streams of data. One stream is a list of data in external memory, and the 
other is either a list of data on the parameter stack or a stream of values 
input from the ASIC bus. 


smaca is the instruction to initiate a streamed MAC with the ASIC bus and 
SMACS initiates a streamed MAC with list on the Parameter Stack. In general, 
the code to implement the algorithm is as follows: 


mem_addr count-1 SMACS OFC "DMA" 
mem_addr g-addr SR! count-1 . SMACA OFC "DMA" 


where: 
men_addr -—sésiS: the address of first data item in memory list 
count-1 is the number of items in list 
OFC is the instruction to implement streaming 
uDMA" is instruction to read sequential items from memory. 
g-addr is the ASIC Bus address to be streamed : 


The "smaca" and "smacs" instructions are used to set the source of one input 
data stream, and are part of a special opcode sequence which is used 
exclusively in the RTX-2010. These commands initiate a processor state 
which affects the operation of the instructions that follow it. The. 
instructions for streamed MAC must occur in the sequence previously 
described. smaca and smAcs suppress interrupts so that the oF¢ instruction is. 
guaranteed to follow without interruptions. 


The "pma" opcode is an existing RTX instruction that normally reads data 
from memory into NEXT, over writing the data in NEXT. During each 
cycle, the contents of the memory location addressed by TOP are read into 
the NEXT Register. Concurrently, the contents of TOP are incremented by 
the value of the five bit literal field of Gen instruction, to generate the next 
memory address to be read. 
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When used following the smaca or smacs instructions, the operation of the 
"pMA" opcode is modified. In the special processor state that is initiated by 
SMACA Or SMACS, the pMA instructions cause the contents of NEXT to be 
multiplied by the contents of a pipeline register from the stack or ASIC Bus, 
and added to the 48-bit accumulator. In the same clock cycle, the oma 
instruction also performs its normal operation, reading the next data item 
from memory into NEXT and auto-incrementing the address in TOP. 


The following is an example of a streamed multiply/accumulate operation 
between a list of data in memory and another list of data on the stack. The 
stack list is assumed to be in another stack area, necessitating the saving and 
restoring of a stack pointer. 


\ ( mem_addr count-1 stack_addr -- answer ) 
SPR@ MD! SPR! \ save & set stack pointer 


SMACS \ set streamed mac instruction execution 
OF ¢ . \ set stream count 
DMA \ execute streamed MAC DMA | : 
DROP DROP \ eliminate address and last data from stack 
“MD@ SPR! \ restore the stack pointer | 


MLR@ MHRA MXR@ \ fetch 48-bit accumulated value 


The next example performs the same operation, with the exception of the 
second argument being a stream of input from the ASIC Bus data. The smaca 
instruction requires the desired ASIC address to be stored in the SR Register. 


\ ( mem_addr count-1 asic_addr -- answer) 
SR! \ set ASIC bus address 
SMACA \ set streamed mac instruction execution 
OF ¢ \ set stream count 
DMA \ execute streamed MAC DMA 
DROP DROP \ eliminate address and last data from stack 


MLR@ MHRA MXRA \ fetch accumulated value 


The opcode for DMA is E842H. 
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. |) 7.8 Class 11-a: ASIC Bus Access © 


This class of instructions manipulates the contents of devices attached to the 
ASIC Bus. This includes the RTX internal registers (Index Register, 
Configuration Register, Multi-step/Multiply Divide, Square Root, and Program 


S : . #4 Counter), On-Chip Peripheral devices (Stack Controllers, Interrupt Controller, 


Multiplier, Counters), and external I/O devices such as UARTs or SCSI 
controllers. 


The processor is able to directly access 32 ASIC Bus devices/registers. The 
specific ASIC Bus address is encoded as a 5-bit field in the instruction (indicated 
by " gggge" in the instruction formats). See Section 7.8.1 for more information 
in instructins eS use this instruction format. 


_ ASIC Addresses 0 - 17H are used internally by the RTX processor for registers. 
Chapter 4 describes the register address assignments. Some of these addresses 
perform special functions when referenced with different forms of the ASIC Bus 
instructions. Section 7.8.2 describes these special instruction forms. 

ASIC Addresses 18 - 31H are provided for access to —_ ASIC devices. 
7.8.1 ASIC Bus Instructions 


Instructions which access ASIC Bus locations have the specific location encoded | 


as a 5-bit field in the instruction. This field is indicated by "ggggg" in the 


Instruction formats. The 5-bit field enables the processor to directly access 32 
ASIC Bus devices/registers. 


Some of the ASIC Bus instructions perform ALU operations on the data 


accessed. These operations are indicated by “cccc" in the instruction formats and 
are the same as those described in the “Single-step Math Functions" class. 
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ASIC Access _G-read, DROP, Invert 


| G-read, DROP, Invert 


a 


Description: © 

Ifi=0 Reads and discards data from ASIC address ggggg. Useful 
for performing "“dataless" I/O accesses, in which an I/O 
device needs to be addressed, but no data transfer needs to 
take place. See Section’ 7.7.1)for limitations on the use of 
this opcode. ace ee ee eee 

Ifi=1 Reads and discards data from ASIC address ggggg, inverts 
TOP. See remainder of description above. 

Number of cycles: 1 


Processor operations: 
If I — 0 Jom tT 


Ifi=1 *T => T 


Parameter Stack effect: 
Ifi=0 NT -- NT 


Ifi=1 NT -- N*T 
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|| ASIC Access N=>Pstack, T=>N, (ggggg)=>T, Invert 


N= > Pstack, T= >N, (gege)= >T, Invert 


Description: 
Ifi=0 Pushes NEXT onto stack, TOP into NEXT, then reads data 
| from address ' BEEBE into TOP. 
Ifi = 1 _ Pushes NEXT onto stack, TOP into NEXT, then reads data. 
from address ggggg into TOP, inverting data. 
Number of cycles: 1 


Processor operations: 


at 
Vv 
— 


Ifi=0 — N => Pstack T= N-  — (ggggg) 


ot 
Vv 
=f 


Ifi=1 - N &> Pstack T => N *(ggg99) 


Parameter Stack effect: 


Ifi=o  NT--NTd 


Ifi=1 NT NT td 


186 7 | _ RTX 2000 Family Programmer's Reference Manual _ 


ASIC Access 


N= >Pstack, T= >N, T-op-(ggggs) 


N= >Pstack, T=>N, T-op-(ggggg) 


Description: 


_Number of cycles: 


Processor operations: 


N => Pstack 


Parameter Stack effect: 


Pushes NEXT onto stack, TOP into NEXT, then reads 
data from ASIC address ggggg and loads TOP with 
results of ALU operation cccc on original contents 
of TOP and data. . 


T => N T-op-(ggggg) => T 


NT -- NT T-op-d 
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| ASIC Access = 8 T= > (gggegg), Invert 


T=>  (eeeee), Invert 


Description: | 

Ifi = 0 Writes contents of TOP to ASIC address ggggg. Origi 

| contents of NEXT are left intact. See Section 
limitations on the use of this opcode. 

Ifi=1 - Writes contents of TOP to ASIC address gegeg. Inverts 
original contents of TOP. Original contents of NEXT are left _ 
intact. 

Number of cycles: §—_ 1 


Processor operations: 
If i= 0 T => (ggg99) T => T 


Ifi=1 | | T => (ggggg) | *T => T 
Parameter Stack effect: 
Ifi = 0 NT -ONT) 


Wi=1 NT WaT 
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ASIC Access T=>(ggggg), N=>T, Pstack= >N, Invert 


T= > (ggggg), N= >T, Pstack= >N, Invert 


Description: | 

If i = 0 Writes contents of TOP to ASIC address geggg. Copies 
NEXT into TOP. Pops stack into NEXT. 

Ifi=1 Writes contents of TOP to ASIC address ggggg. Copies 
NEXT into TOP, inverting value. Pops stack into NEXT. 

Number of cycles: 1 


Processor operations: 
Ifi=0 T => (ggggg) N => T Pstack => N 


Ifi=1 T => (ggggg) “No => T Pstack => N- 


Parameter Stack effect: 
Ifi=0 | | NT --N 


Ifi=1 . NT -- *N 
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ses Sis Sse | _ | 
=) ASIC Access  (ggggg)-op-T 


(ggggg)-op-T 


Description: Reads data from ASIC address ggggg, and loads 
| TOP with results of ALU operation cccc on contents 
of TOP and data. Original contents of NEXT are left 
unchanged. | 


Number of cycles: 1 


Processor operations: 


(ggggg)-op-T => T 


Parameter Stack effect: 


NT -- N d-op-T 
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47.8.2. Predefined ASIC Bus Instructions 


Some RTX ASIC Bus opcodes are predefined to perform specific functions. 
These include "Select Data Page Register", "Select Code Page Register", fe: 
"Software Interrupt", and "Remove Software Interrupt". Descriptions of these ae | 
opcodes follow. — | | -_ coe 


The Multiply, MAC, and. Barrel Shifter are also controlled using predefined 
ASIC instructions. See Section 7.7 for descriptions. 
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Description: Sets DPRSEL Bit to 1, causing all data memory accesses to 
be addressed through the Data Page Register. 
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Memory Page Access | Select Code Page Register 


Select Code Page Register 


Description: Sets DPRSEL Bit to 0, causing all data memory ACCESSES to 
be addressed through the Code Page Beutel: 
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Interrupts _ _ ~ Software Interrupt Request 


~ Set SOFTINT 


Description: — Sets the Software Interrupt Request flip-flop, generating a 
Level 13 interrupt to the processor. Due to the time 
required by the processor internally to generate and process 
the interrupt signal, this instruction should be followed by 
two 1-cycle instructions which do not depend on whether or | 
not the interrupt has been serviced (NOPs, for example). 
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Interrupts | Remove Software ant © 


Clear SOFTINT 


Description: ———_—&Resets Software Interrupt Request flip-flop. The interrupt 
| service routine for the Software Interrupt (level 13) must 
execute this instruction before re-enabling interrrupts or 

executing a subroutine return. 
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| 7.9 ~~ Class 1b - Short Literals 

ea | This class of instructions generates short literals (positive values 0 to 31). | 
oe The value of the literal is embedded in the: instruction and is shown as "ddddd" | 
|=] in the instruction formats. The value represented by "ddddd" is loaded into bits 


ee _ 0-4 of the TOP register; bits 5-15 are set to 0. If the value is inverted by having 
the "i" bit set in the instruction, all 16 bits of the value are inverted. 


| Some of these instructions perform ALU operations using the literal data. These 
-_ operations are indicated by “cccc" in the instruction formats and correspond to — 
Table 7.3. 
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Short Literals 


d Invert 


d Invert 


Description: 
Ifi = 0 
Ifi=1 


Number of cycles: 


Pushes NEXT onto stack, copies TOP into NEXT, loads the 
value ddddd into TOP. : 


Pushes NEXT onto stack, copies TOP into NEXT, loads the 


value ddddd into TOP, and inverts the value. 


. Processor operations: 


1 
N => Pstack . ' JT s> N ddddd => T 
N => Pstack T => N *ddddd => T 


Parameter Stack effect: 


NT -- NT ddddd 


N T -- NT *ddddd 
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N= > Pstack, T= >N, T-op-d 


Description: Pushes NEXT onto stack, copies TOP into NEXT, 
then loads TOP with result of ALU operation cccc 
between contents of TOP and value ddddd. 


Number of cycles: 1 


Processor operations: 


N => Pstack T => NW | T-op-ddddd => T 


Parameter Stack effect: 


N T -- N T T-op-ddddd 
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Short Literals | d=>T, Invert 


d=>T, Invert 


Description: 
Ifi=0 Loads the value ddddd into TOP. Original contents of NEXT 
| are left unchanged. 
Ifi=1 Loads the value ddddd into TOP, a 16 bits of the 
value. Original contents of NEXT are left unchanged. 
Number of cycles: 1 


Processor operations: 
Ifi=0 ddddd => T 


Ifi=1 *ddddd => T 


Parameter Stack effect: 
STfi=0 NT -- N. ddddd 


Ifi=1 NTN *ddddd 
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|= Short Literals | | c-op-T 


d-op-T 


Description: | Loads TOP with results of ALU operation CCCC. 
7 | between value ddddd and contents of TOP. Original 
contents of NEXT are left unchanged. 


Number of cycles: ar | 


Processor operations: 


ddddd-op-T => T 


Parameter Stack effect: 


NT -- N ddddd-op-T 
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7.10 Class 12: User Memory Access 


This class of instructions performs reads and writes to the User Memory Space. 
The User Page Register and User Base Register determine the address of the 32- 
word user memory block. | 


The address to be accessed within the User Space is encoded as a 5-bit field in 
the instruction, and is indicated by “uuuuu" in the descriptions. Note that 
“uuuuu" represents the word address of the location to be referenced. For 
example, uuuuu = 3 will perform a read or write to word #3 (byte #6) in the 
User Space. All User Memory Space accesses read or write a 16-bit value. 


The data written to or read from the User location is indicated in the descriptions 
as "(u)". | i 


Some of these instructions perform ALU operations using the memory data. 
These operations are indicated by “cccc" in the instruction formats and 
correspond to Table 7.3. 
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1 User Memory Access ON = > Pstack, (u)= >N, Invert 


N= > Pstack, (u)= >N, Invert 


Description: 
Ifi = 0 Pushes NEXT onto stack, then reads data from user location 
uuuuu into NEXT. 
Ifi=1 - Pushes NEXT ania stack, then reads data from u user location 
| uuuuu into NEXT. Inverts TOP. 
Number of cycles: 2 


Processor operations: 


Ifi = 0 ist cycle:  -N => Pstack (u) => N 
end cycle: NOP | 

yfi=i1 4st cycle: N => Pstack | | (u) => N 

end cycle: T => ¥*T | 


Parameter Stack Effect: 
Ifi = 0 NTN DT. 


Ifi=1 | NT -- N (u) *T) 
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User Memory Access (u)=>T, Invert 


(u)= > T, Invert 


Description: | ava ’ M7 

Ifi = 0 Reads data from user location uuuuu fA *TOP. Original 
contents of, NEXT are left unchanged. 

Ifi=1 Reads data from user location uuuuu ifito TOP, inverting 


data. Original contents of NEXT are left unchanged. 
: | 
( 


Number of cycles: 2 


Processor operations: 


Ifi=0 ist cycle: (u) => N 
end cycle: T => N 
Ifi=1 - Ist cycle: (u) => N 
2nd cycle: T => NW 


Parameter Stack effect: 
Ifi=-o0 YT e-W-tu—> / 7. — | ) 
Ifi=1 NeTan—-N“* CUS A T etic A) ms | 9,9 ( lA, 
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User Memory Access 


a Ne >Pstack, T= >N, T-op-(u)=>T ; 
ROOD EEEo OOOO OO 
Pushes NEXT onto oe TOP into. NEXT. then loads 


Description: Ss 
‘TOP with result of ALU operation cccc: on contents 
of TOP and data read from user location uuuuu. 


Number of cycles: 2 
‘Processor operations: 
ist cycle: N => Pstack (u) => ND 
2nd cycle: T => N N-op-T => T 


Parameter Stack effect: 
NT -*e NT T-op-(u) 


RTX 2000 Family Programmer’s Reference Manual 


204 


oS 


OK fa 
XRKKSY 


User Memory Access T=>uuuuu, Invert 


SRSA? 


OOOO SOSO 


T=> uuuuu, Invert 


Se 


Description: 
Ifi=0 . _ Writes the contents of TOP to user location uuuuu. Original 
| | contents of NEXT are left unchanged. 
Ifi=1 | Writes the contents of TOP to user location uuuuu. Inverts 
contents of TOP (after write operation). Original contents of 
NEXT are left unchanged. : 
Number of cycles: 2 


Processor operations: 


[iWfi=0 4st cycle: T => wuuuuu 
2nd cycle: T => T 


Ifi=1 ist cycle: T => wuuuu 
| * 2ndcycle: *T => T 


Parameter Stack effect: 
fi=0 | _ NT--NT 


Ifi=1 ONT NT 
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User Memory Access 


T= >uuuuu, N= >T, Pstack= >N, Invert 


’ 


Description: 

Ifi=0 Writes contents of TOP to user location uuuuu. Moves 
NEXT into TOP. Pops stack into NEXT. | 

Ifi=1 Writes contents of TOP to user location uuuuu. Moves 
NEXT into TOP, inverting the value. Pops stack into NEXT. 

Number of cycles: 2 


Processor operations: 


Ifi=0 | 1st cycle: T => uuu 
énd cycle: N => T Pstack => N 
Ifi=1 4st cycle: T => uuu 


2nd cycle: *N => T Pstack => N 


Parameter Stack effect: 
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User Memory Access T-op-(u)= >T 


T-op-(u)= >T 
bb Pelefefelefef of te fefefefe 


Description: - Loads TOP with results of ALU operation cccc.on contents _ 
of TOP and data read from user location uuuuu. Original 
contents of NEXT are left unchanged. 


Number of cycles: 2 


Processor operations: 
Ist cycle: | - N => Pstack (u) => ON 


2nd cycle: T-op-N => T Pstack => N 


Parameter Stack effect: 


NT -- N T-op-(u) 
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_ = | 7.11 Class 13: Long Literals 


This. class of instructions generates 16-bit literal values. The 16-bit value is 
contained in the memory location following the Long Literal instruction. The . 
value contained in this location is identified in the descriptions by “D". Long. 

Literal instructions are the only RTX instructions which occupy two 0 memory 


BE RRs 
eieeeea locations. 
0.@ Sf 


Some of these instructions perform ALU sputio using the literal data. These 
Operations are indicated by “cccc" and correspond to Table 7.3. 
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Long Literals : N= > Pstack, D= >N, Invert 


N= > Pstack, D= >N, Invert 


Description: 

Ifi=0 Pushes NEXT onto stack, loads literal value into NEXT. 

Ifi=1 Pushes NEXT onto stack, loads literal value into NEXT, 
inverts TOP. 

Number of cycles: 2 


Processor operations: 


Ifi=0 ist cycle: N => Pstack D => N 
énd cycle: NOP 
Ifi=1 Ist cycle: N => Pstack D => N 


énd cycle: *T => T 


Parameter Stack effect: 
Ifi=0 NT --NDT 


Ifi=1 NT --ND *T 
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Long Literals N= > Pstack, T= >N, D= >T, Invert 


N=>Pstack, T=>N, D=>T, Invert 


Description: 

Ifi=0 Pushes NEXT onto stack, TOP into NEXT, then loads literal. 
value into TOP. 

Ifi=1 Pushes NEXT onto stack, TOP into NEXT, then loads literal 
value into TOP, inverting the value. 

Number of cycles: 2 


Processor operations: 


Ifi = 0 ist cycle: N => Pstack : D => N 
2nd cycle: TT = N oN = T 

Ifi=1 ist cycle: N => Pstack — D => N 
| 2nd cycle: T => N | *“N => T 


Parameter Stack effect: | 
Ifi= 0 NT--NTD 


Ifi=1 - NT -- NT *D 
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Long Literals N= >Pstack, T= >N, T-op-D=>T |. 


o 


N= > Pstack, T= >N, T-op-D= >T 


Description: . Pushes NEXT onto stack, TOP into NEXT, then loads 
TOP with results of ALU operation cccc between 
Original contents of TOP and literal. | 


Number of cycles: 2 


Processor operations: 
ist cycle: W => Pstack D => N 


end cycle: T => N T-op-N => TD 


Parameter Stack effect: 


NT -- NT T-op-D 
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=. Long Literals oo -—-: D=>T, Invert 


D= > T, Invert 


Deseri ption: | 


fi = 0 3 “Loads literal value into TOP. Original contents of NEXT are 
is | | left unchanged... | 


Wfi-= 1 ok Loats literal value into TOP, inveiting the value. ee 
" | ~. contents of NEXT are left St oe te | 


Number of cycles: a 


Processor operations: 


Wis 0 4st cycle: = -N => Pstack op = N 
2nd cycle: N= T° is Pstack => N 


Wfi=1 4st cycle: N= Pstack 0 => N 
| 2nd cycle: Nos TO — Pstack => N 
- Parameter Stack effect: 

Wi=O0 wt-wo. 


Wi=2o02 22 wt uD 
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Long Literals 


overs 


Description: Loads TOP with result of ALU operation cccc between literal 
value D and contents of TOP. Original contents of NEXT are 
left unchanged. 


Number of cycles: 2 


Processor operations: 
ist cycle: N => Pstack : D => N 


2nd cycle: T-op-N => T Pstack => N 


Parameter Stack effect: 
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7. 12 Classes 14 and 15: Data Memory Access 


eS These two classes of instruction perform reads and writes to Data Memory 
eee) Space. The DPRSEL bit controls which page address register selects the 
ey accra page. 


| 2) - The address within the page of the location to be accessed is contained in the 
TOP register. For memory reads, the data is moved from memory into the NEXT 
register. For memory writes, the data moves from NEXT into memory. 


The instruction formats are identical for both word and byte access. The "s" bit 
ve 12) of the instruction dictates the size of the operand (s = 0 for 16-bit word, 
= 1 for 8-bit byte). | 


For byte writes to memory, the contents of bits 0-7 of NEXT are written to the 
memory location addressed by TOP. For byte reads, the memory data is read 
into bits 0-7 of NEXT; bits 8-15 of NEXT. are set to 0. 


The data read from or written to memory is identified j in the descriptions as "m". 
Short literals are is ientfied a8 "Ty d" The memory location ene e) ‘ 
\ contents of TOP is ane as "Ty. ‘A 


Some of the i instructions may perform ALU operations on the cee These 
operations are identified by either “ccccor "aaa" in the instruction format, and 
correspond to the values in Table 7.3. 


“(T) 
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Data Memory Access = #$N=>T,m=>N, Invert 


N=>T, m= ON, Invert 


Description: 
Ifi=o _ Moves NEXT into TOP. Loads memory data contained i in the 
location addressed by TOP into NEXT. 
Ifi=1- | Moves NEXT into TOP, inverting value. Loads memory data 
| contained in the location addressed by TOP into NEXT. 
Number of cycles: 2 


/ 


Processor operations: 


Ifi = 0 1st cycle: N => T | : m => N. 
2nd cycle: T = T | 

Ifi=1 Ist cycles WN => T | m => N 
2nd cycle:  § *T => T a 


- Parameter Stack effect: 
Ifi=0 | NT g m N 


ifi=1 NT --m*N 
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Data Memory Access | | m=>T, Invert 


m= >T, Invert 


Description: 

Ifi = 0 Loads memory data into TOP. Original contents of NEXT 
are left unchanged. 

Ifi=1 | Loads memory data into TOP, inverting data. Original 
contents of NEXT are left unchanged. 

Number of cycles: 2 


Processor operations: 


Ifi=0 ist cycle: N => T m => N 
end cycle: N => T T => N 
Ifi=1 ist cycle: _ N => T 4 m.=> N 
end cycle: *N => T T => N 


Parameter Stack effect: 


Ifi=0 NT--No 


Ifi=1 | ae 
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Data Memory Access _wopN= =>T 
N-of -r 
an-op-N= 
ttt ftfsfefefefefofofelx]x]x[x]x 
Description: | Loads TOP. with ails of ALU operation cccc 


between memory data and contents of NEXT. 
Original contents of NEXT are left unchanged. 


Number of cycles: 2 


Processor operations: 
_ Ist cycle: N => T m => N 


2nd cycle: T => N T-op-N => T 


Parameter Stack effect: 


oo yen 


a A N T ~~ 
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Data Memory Access ; {N= > Pstack}, m= >N 


| {N= > Pstack}, m=>N_ 


Description: 
If p = 0 Loads memory data into NEXT. Original contents of TOP 
are left unchanged. _ | 
If p = 1 =~—.:~ Pushes NEXT onto stack. Loads memory data into NEXT. 
_— Original contents of TOP are left unchanged. 
Number of cycles: 2 


Processor operations: 


If p= 0 ist cycle: m => N- 

end cycle: =. no operation 
if p =] Ist cycle: N => Pstack m => N 
: | 2nd cycle: no operation | 


Parameter Stack effect: 
Ifp=0 NT-mT 
Ifp=1 NT-NmT 
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Data Memory Access {N= > Pstack}, m= >N, d= >T 


{N= > Pstack}, m= >N, d= >T 


Description: 

Ifp = 0 Loads memory data into NEXT, and short literal value ddddd 
into TOP. 

Ifp=1 Pushes NEXT onto stack, loads memory data into NEXT, and 
short literal value ddddd into TOP. 

Number of cycles: 2 


Processor operations: 


Ifp = 0 ist cycle: m => N ddddd => T 
2nd cycle: no operation 

Ifp =1 1st cycle: N => Pstack =m => N  ddddd => T 
2nd cycle: no operation : 


Parameter Stack effect: 
If p = 0 3 NT -- m ddddd 


Ifp=1 NT -- Nm ddddd 
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| Data Memory Access 


{N= > Pstack}, m= >N, T-op-d= >T | 


Description: 

Ifp=0 Loads memory data into NEXT. Loads TOP with results of 
ALU operation aaa between the contents of TOP and short 
literal ddddd. | 7 

Ifp=1 Pushes NEXT onto stack. Loads memory data into NEXT. 

| Loads TOP with results of ALU operation aaa between the 

contents of TOP and short literal ddddd. 

Number of cycles: 2 


Processor operations: 


If p= 0 ist cycle: m => N T-op-ddddd => T 
2nd cycle: no operation 

Ifp=1 ist cycle: N => Pstack m => N T-op-ddddd => 1-1 
2nd cycle: no operation — 


Parameter Stack effect: 


Ifp=0 


Ifp=1 
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Data Memory Access 


N= >(T), N=>T, Invert, Pstack= >N 


x» 
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Description: 
Ifi = 0 . Writes data in NEXT to location addressed by TOP. Copies 
NEXT into TOP, pops stack into NEXT. 
Ifi=1 | Writes data in NEXT to location addressed by TOP. Copies 
| 7 NEXT into TOP, inverting contents, then pops stack into 
NEXT. 7 
Number of cycles: Z 


Processor operations: 


Ifi=0 ist cycle: N => (T) N => T Pstack 


=> WN 
and cycle: T => T 

Ifi=1 1st cycle: N => (T) N => T Pstack => N 
énd cycle: *T => T 


Parameter Stack effect: 
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ce So  acieataac Access N=>(1), Pstack=>N,T, Invert 


N= >(F Pstack= >N, T, Invert. 


Description: | 
Ifi= 0 . Writes contents of. NEXT to location addressed 1 by TOP. | 
| | Pops new values into TOP and NEXT. a | 
If i =1 "Writes contents of NEXT. to. ocation addressed by TOP. | 
fo - Pops new values into TOP and NEXT. Inverts n new contents 
(Of TOP. | 
‘Number of cycles: wa ADS 


- Processor operations: | 


Ifi=0 © 1st cycte: - we) NN» T Pstack => N 
tg © 2nd cycle: N => T — Pstack => N aoa 
Ifi=1. istcycle: oN => (1)  ~N-=> T _ Pstack => ON 


2nd cycle: *N => T  Pstack => N 
Parameter Stack effect: 
fi=0 | SNT--S 


Wfi=1 | _ | St es 
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N-or-r" 
Data Memory Access  -op-N=>T, Pstack= >N 


Nope 
Jwop-N= >T, Pstack= >N 


_ Description: Loads TOP with results of ALU operation cccc between 
memory data and contents of N. Pops stack into N. 


Number of cycles: 2 


Processor operations: 
1st cycle: — N => T ("om => N 


2nd cycle: T-op-N => T Pstack => N 


Parameter Stack effect: 
N T - m-op-N-- -0 f om ae 
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Data Memory Access N= >(TI), {Pstack= >N} 


N=>(T), {Pstack=>N} 


Description: 
If p= 0 Writes contents of NEXT to memory location addressed by 
| TOP. Original contents of TOP (address) are left unchanged. 
Ifp=1 Writes contents of NEXT to memory location addressed by 
TOP. Original contents of TOP (address) are left unchanged. 
Stack is popped into NEXT. 
_ Number of cycles: 2 


Processor operations: 


Ifp = 0 ist cycle: N => (T) 
end cycle: no operation 

Ifp=1 1st cycle: N => (T) Pstack => N 
2nd cycle: no operation 


Parameter Stack effect: 


If p= 0 NTo-NT 


Ifp=1 | NT --T 
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Data Memory Access N=>(T), d=>T, {Pstack=>N} 
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N=>(T), d=>T, {Pstack=>N} — 
ES 
Description: 
Ifp=0 Writes contents of NEXT to location addressed by. TOP. 


Loads short literal ddddd into TOP. 
Ifp=1 Writes contents of NEXT to location addressed by TOP. 
Loads short literal ddddd into TOP. Pops stack into NEXT. 


Number of cycles: Z 


Processor operations: 


Ifp=0 ist cycle: N => (T) ddddd => T 
| 2nd cycle: no operation 
Ifp =1 1st cycle: N => (T) Pstack => N ddddd => T 
2nd cycle: no operation 


Parameter Stack effect: 
If p= 0 NT -- N ddddd 


Ifp=1 NT -- ddddd 
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| Data Memory Access \ N= >(I), &op-F= >T 
N=>(T), aaa >T | 

Description: 

If p=0 _ Writes contents of NEXT to location addressed by TOP. 
Loads TOP with results of ALU operation aaa between short 
literal ddddd and contents of TOP. 

Ifp=1 Writes contents of NEXT to location addressed by TOP. 
Loads TOP with results of ALU operation aaa between short 
literal ddddd and contents of TOP. Pops stack into NEXT. 

Number of cycles: | y 


Processor operations: 


Ifp = 0 1st cycle: N => (T) ddddd-op-T => T 
énd cycle: no operation — 

Ifp =1 1st cycle: N => (T) Pstack b. ddddd-op-T => T 
2nd cycle: no operation 


Parameter Stack effects: 


_ap-ddddd 


If p= 0 NT --N gdddd=op:T A) 


Ifp=1 NT == dddddeop-t </ 


226 | RTX 2000 Family Programmer's Reference Manual 


7.13 Undefined opcodes 


The following bit patterns are reserved for future use and should not be used for 
opcodes: 


User Space: 


Long Literal: 
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8 | : Step Math Functions 


The Harris RTX 2000 Series Microcontrollers all include a unique and powerful 
set of instructions known as Step Math Instructions. These instructions allow 
the RTX microcontrollers to perform certain math operations. much more quickly | 
- than would be eee without them. , 


81 Introduction 


Step math operations include signed and unsi igned multiplication, unsigned 
division, integer square root, bit reversal and cyclic redundancy checks. They 


also expand: the RTX Processons, ability to pertonn logical rotation operations. 


In order to achieve this increase in efficiency, the processor operates differently 

than when performing ordinary math. To explain this in simplified terms, 
‘intuitive mnemonics will be used here because of the number of operations that 

“can happen in a single cycle. Forth descriptions are used only where doing so_ | 
Clarifies the operation. In general, it is best to consider step math operations as 

| Forth primitives. | 


8.1.1 Step Math Using The RTX 2000. 


All of the step math functioris listed dove can be sealed on 
‘the RTX 2000 Microcontroller. However, because of the 
hardware multiplier which is incorporated on-chip. with this 
product, a special set of single cycle instructions is used to 
perform multiplication in the place of step math operations. See 
Section 6.3.1 for more detailed information about the one 
hardware multiplier. 7 
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8.12 Step Math Using The F RTX 2001 


On the RTX 2001A, step math spetations are used to _—— all | 
of the functions listed i in the introduction. 


B13 “Step Math Using The RTX 2010 


All af the step math éifiotionis listed can be Sertonned on the | 
RTX 2010 Microcontroller. However, because this product. 
provides the hardware Multiplier/Accumulator, Barrel Shifter, and ~ 
other Floating Point Support on-chip, special instructions are used 
to’ perform some math operations in place of step math 
operations. See Section 6.3.2 for more detailed information about 
the on-chip hardware math support for the RTX 2010. ‘3 
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8.2 Data Flow in Step Math 


Figure 8.1 shows the data flow diagram for all step math operations. Even 
though the hardware to perform step math is always present, much of it is 
inactive when not performing step math and therefore it is not aes outside 
of discussions on step math. , : 


Note that the: ALU is followed by a shifter. This allows an ALU operation and 
a shift to be performed in a single = without passing the data through the 
ALU twice. 


Step math operations a6 use two special purpose registers (MD and SR) and one > 
pseudo register (SQ) in their operations. There are also dedicated shift blocks 
and logical OR blocks used with the MD and SR registers so that data in them 
does not have to pass through the ALU to be updated. 


The result of this architecture is that the equivalent of five ALU operations can 


be performed in a single cycle, and the cycles eau to mansiet data for these — 


ALU operations are eliminated also. 
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~~ FIGURE 8.1: RTX STEP MATH DATA FLOW DIAGRAM __ 
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$3 17-Bit Math 


_ Many of the step math operations treat the TOP register as a 17-bit wide register 
to accomplish their tasks. Correspondingly, the ALU is extended to 17 bits for | 
these operations by the 17th-bit adder. Since the 17-bit. result is sometimes 
shifted left one bit, an 18th bit is also needed to store the shifted bit. The 17th. 
and 18th bits are held in bits zero and one of the Configuration Register (CR), 
a consequently shane only at the end of a ook on the rising edge.of PCLK. 


These bits are sometimes referred to in the data. sheers: as the carry (CY) ‘i , 
complex carry (CCY) bits, respectively, but i in the context of step math, this may 

_ be misleading nomenclature. In this case, these bits are more accurately thought 
of as an extension of the TOP eBIIE: and will be referred to here only as CRO 
and CR1. : 


There are values referred to as CY and CCY in other sections in this manual, 
which under certain conditions are clocked into CRO and CR1 at the end of a 
~ cycle, though these values are not necessarily true carry bits. When this is the 
case, CRO and CRI may contain the pau CCY Tesult of the previous math ort 


Bi 
t 
. re 


a _ Note that CRO only changes when performing an ALU or shift operation, and 
CRI only changes during step math operations that include an arithmetic ALU 
, . operation. Also note that CRO and CR1 are sometimes referred to as CQ and 
co aPeevey in other sections of this manual. 


Ls 
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| 8.4 ‘The Step Math Instruction Format 


ordinary math operations. The op code for Step | math instructions is divided into 
| groups of micro opcodes, each of which has a specific effect on the instruction. 
| ‘The general format for BED math instructions is shown in. Figure 8. a | 


4 Signed/Unsigned 
Micro Opcode 
Field — 


Peery rem ee 
Micro Opcode Field 


eondtetonal Behavior 
- Micro Opcode Field 


Reglater Selection 
Micro Opcode Field 


ALU Micro opcode Field 


~ FIGURE 8.2: STEP MATH INSTRUCTION FORMAT. 


As is evident from the step math instruction format, there are ten bits, not 

including the Retum bit, that determine the step math operation, which implies. 

.. that there are 1024 possible step math operations. While this is true, not all of 
7 these ¢ Operations are useful. 
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- All a math apeasoas: chave the same fotmat which is ‘similar to that of 


to "REG" indicate the MD or the SQ register as determined by ae 
as described in Section 8.4.2 and shown in Table 8.2. | 


8.4. 1 ALU Micro Opcode Field (aaa) 


| Step math operations usually include a conditional ALU operation Sees the 


TOP register and another register, either the MD or the SQ pseudo register. Step 


- math operations may also include an ALU No- “Operation or a conditional ALU 


load from es MD or SQ register. 


TABLE 8.1: STEP MATH ALU FUNCTIONS 


[Lee | runceton 


No ALU operation 
=> TOP and REG 
=> TOP - REG 


YES 
| YES 
YES 
_ YES 
YES 
YES 
YES 


TOP or REG 
TOP + REG 
TOP xor REG 
REG - TOP 
REG 


Whether the conditional ALU operation occurs depends on a pseudo variable, 
called "YES". If YES is true (1), the conditional. ALU operation. will be 
performed; if YES is false (0), the contents of TOP will be preserved, though in 
either case the contents of TOP may be shifted by an unconditional shift 


Lc The procedure for determining XPS will be explained oy 


The 18-bit result of the conditional ALU operation, as determined by YES, results 


in a value called "W," as shown in Figure 8.1. This value is then shifted to 


determine the value shifted into CR1, CRO,.TOP.and NEXT. If the ALU - | 


operation is not arithmetic (i.e. + or -), 'W17 and Wwi6, are ae value stored i inCR1 
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— To i 3 


we 
~ and CRO a aa - er hi € CG Pk Ie ho 
ca ak a wit : WRAY © RA WaN fe 
Conditional ALU operations aueae see math are summarized 7 . References 
"r" bit ee 8), \ 


Ot 


8.4.2 Register Selection Micro Opcode Field (r) | 


The register selection micro ‘opcode field (bit 8) determines whether the input to 
the ALU operation will be the MD or the SQ register.. If r = 0, the input will be 
the MD Renee if r= L, the input will be: the oe ss an oe 


2 The SQ register is — a pseudo 1 epee ‘there is ‘not a “unique register. 
associated with it. When reading the SQ register, the value obtained is the 
contents of the MD register shifted left one — then logically OR’ed with the — | | 
contents of the SR register. — deve ¢ uy Hed 7, a ee 

" Also, the most significant bit of MD (bit 15) is fed into the 17th-bit ander As 
we Shall see, this allows the RTX processors to calculate the square root of an 
- integer without using Newton’s method. Also, writing data to the SQ register 

(ASIC Bus address 5) has the effect of multiplying the data by 256 and placing © 
it into the MD register. Applications for this procedure is useful include 
calculation of some oot feauoaee checks. 


‘The! ny" bit has another function in step math. If r = 1, ie. the so register is 


selected, the data in MD and SR will be modified at the end of the cycle. The sere 
_ data in MD will canditienally be replaced with MD logically OR’ed with the data 7 ae 
in SR, and the data i in SR wilt be shifted right one dit. — a Lf ore ; | \, 
/~ The condition th that determines Whether this happens i is ; the s same > condition. that * whi, ae 
determines whether an ALU operation will be performed (YES). This behavior ye va ie a ‘ 
is useful in 1 both square t root and bit reversal = | : a, agnt 1 
Ye : Ht | iA nant tog) 
The behavior of MD and SR’as determined by "r is summarized i in Table 8. a. ae fu es 
7 , | i ae 
TABLE 82: MD AND SR OPERATION a 
| ONG pn 
\ +- \ C 
| dt ee 
- TOP (shift) |] | oo te ain | 
TOP (alu op) MD (shift) WSS 
| TOP (shift) | 2" 3 Ag 7) 
TOP (alu op) SQ (shift) ©. a .. td ne - 09 
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8.4 3 Conditional Behavior Micro Pata Field (yy) 


The conditional behavior micro opcode field (yy, bits 6 and 7) determines the 
value of the pseudo variable “YES. " 


In general, YES is determined by the logical combination of one or more bits i in 


_. registers or by the carry, out bit from the 17-bit ALU operation. The bits involved Be 
are the carry out bit C36), bits zero and three of the opcode (Instruction Register 


bits IRO and IR3), CRT bit zero of the TOP a (TO), and bit zero of the 
NEXT Register (NQ). 


Rea a, 
15 amy 
Anas Caen ame 
iameadteatintaatetenin 


Another factor that may affect the result is whether the ALU eon | ; is 
arithmetic or ee The behavior of YES is summarized in Table 8.3. 


| IF ARITHMETIC THEN Cores ELSE IRO : 
(IF ARITHMETIC THEN = 6 ELSE IRO) or cRL 
IF IR3 = O THEN TO ELSE NO 
TO xor NO 


So far, little has been said about the carry out bits, C15 and C16. These are 
outputs of the 16- bit ALU and the 17th-bit adder. 


7 “When, addition i iS s performed, the carry bits are set if the result of the addition i 1S 
too large to place in the available number of bits. When subtraction is performed, 
~ however, the carry bit represents an inverted borrow bit. In this. case, the amy 
bit i is cleared if the result of the subtraction is negative. | 


In step math, the carry bits are used primarily in operations that involve a 
conditional subtraction, namely division and square roots. In these cases, the 
carry bit is set if the subtraction was successful, which causes the result of the 
subtraction to replace the original value in TOP. Both of these cases also shift — 
the result left one bit, which is why an extra bit (CR1) is required, since it always 


_. contains the 17th bit of the result of the most recent arithmetic dy math 


operation. 
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ry 4.4 Subroutine Return Micro Opcode Field (R) 


The operation of the subroutine retum micro 0 opcode, field cit 5)i iS 5 exactly the _ 
same as for oiner: RTX instructions. te 


If this bit equals one, a subroutine return is executed along with the instruction, 
= it equals zero, the next sequential instruction is executed. | 
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8. 4 5 ‘Unconditional Shift Micro Opcode F Field 1) 


Every step math operation may include an unconditional shift operation, ‘(its Ae. 
2, and 3) which is performed on W. Since the shift is performed in a separate 
section from the ALU, the shift may occur in the same oe as the ALU 
enon | 


Shift operations. occurring dunifig step math differ from oe occurring during 
_ ordinary math operations. In particular, the sources of carry 1 in bits to the shifts 
_ ‘may be emercnt, and saa come from YES and CRI. es | 


Shifts may aperic on TOP, NEXT or both. The various shift operations are 
summarized with Forth mnemonics in Table 8.4. 


“TABLE 8.4: STEP MATH SHIFT OPERATIONS | 


. Note that. bits WO through W17 are the result oF the conditional 
ALU yepera ton as determined by YES. et ad 8 


There are. seven important exceptions t to Table 8.4. First, the value clocked into | 
CRI for opcode A057 (hex) is W17 instead of W16 as indicated by the table. 
_. -The second special case occurs when the shift operation is cD2/’ and the "S" bit 7 
_ (bit zero .of_the opcode) is 1, as_it_is in the case_of_si gned_multiply_step 
at instructions. For t these~ instructions, the value of the bit shifted into T15 may - 
ee fitier: from W16. hte | 7 | 
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194 6 Signed/Unsigned Micro Opcode Field (S) 


The signed/unsigned micro. opcode field (bit 0) has a double purpose. FirstMwit 
determines whether right shifts and additions are treated as signed or unsigned | 
| during multiply steps; and second, it is used with yy = 01 to allow manual control © 
of conditional logical ALU operations. . This allows unconditional 17-bit shifts, 
7 forexample. When used with arithmetic ALU. the “S" bit affects the 
sae jul to the 17th-bit adder. | | | | 


ae If S= 0 (unsigned), the adder eperaiesk on CRO and a zero bit. This is 
-.. because the zero is treated as the 17th bit of the MD register. Since MD 
is considered to be unsigned, its 17th bit i is mie azero. tit. 


AES = | (signed), ‘the adder will operate on CRO and MD15. In me 


“case, the sign bit of MD is extended into its 17th bit. If the operation is — 


subtraction, one of the inputs to the adder will be inverted, as will be — 
discussed below. The "S" bit also affects right shifts during renee 2 
qaniep aes: to determine the value shifted into TOP. 2 a 
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‘ 8. 5 Operation of the 17th-Bit Adder 


-. The 17th-bit adder is an extension of the ALU for addition and mibirsction 
- Operations. It can be used for subtraction as well as addition because its inputs 
may be inverted depending on the operation. | 7 


When performing addition, the carry out indicates that the oe is negative, or 
has overflowed, as is the case with the ALU. | | 


When serfomiine subtraction, the carry indicates an 1 inverted borrow, i 1.e. a carry | 
out indicates that the subtraction of the 17th bit did not require a borrow, and that 
the result is non-negative. This i is also ae Same as a carry out during Subtraction 7 
for as ALU. » 7 | 
_. The operation of the 17th-bit adder i is summarized in Table 8. 3. 


TABLE 8.5: 17th-BIT ADDER OPERATION 


0 

0 
0 
1. 
6) Z 
1 

1 
ml 


: Step Math Functions, Chapter 8 | a a 7 | | | 237 ; 


| 8.6 ~ Interrupting Step Math Operations | 


In general. step math operations may be interrupted as long as the interrupt 
handlers are well behaved; i.e. they save and restore any shared resources they 
may use.. This is. in agreement with good interrupt handler design for any 
Processor: _ ic. .3 | _ | | 


Pe The shared resources ‘that affect step math operations are CR bits 0 and 1, MD 
and SR: Any handler that affects these should make sure they are in the same 

State upon returning from the anESTUP that they were in when. the interrupt 
occurred. 


One exception is the signed step multiply operation. If it is interrupted, there is | 
a probability that the. Lely will be. incorrect, though this ones on the values 7 


Fg Sy gers 


SOIC e-duzatio 
“interrupted, ‘these: values’ will be’ 1 SPN : 
“signed “multiplication isnot *satficient=to. prevent apt as this does not 7 
Piven the last step wale is not sieatned) from. being icity aca 
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| 8.7 8 Some Useful Opcodes 


It may help to look at some specific step math operations to get a feel for how 
they are used before considering specific cases. Table 8.6 provides a list of 
useful step math opcodes with their Forth mnemonics and a brief description. 
_The one sections describe these operations in more bee : 


TABLE 8.6: SOME USEFUL STEP MATH OPCODES | 


OPCODE FORTH DESCRIPTION . 


17 Bit left shift . 

Rotate TOP:NEXT right 

Rotate TOP right — 

‘Signed multiply steps 1-15 
Signed multiply step 16 
Unsigned multiply steps 1-16 
Mixed sign multiply step 16 - 2 
Byte unsigned multiply steps 1-8 
Byte mixed sign multiply step. 8 
Unsigned divide step -1 

Unsigned divide steps 2- -15 
Un'signed divide step 16 : 
Alternate unsigned divide step 16. 
Byte unsigned divide steps ls 8 
Square root step 1 

Square root steps 2-15 

Square root step 16- 

Byte Square root step 1 

Byte square root steps a 8. 

Bit reversal step 

CRC step 
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| 8. 8 . Step Multiplication 
In applications i in which multiplication is needed, the RTX 2001A Microcontroller ° 
uses the step multiplication operations which are described in the following © 
sections. These step math multiplication operations would not normally be 


| performed on the RTX 2000 Microcontroller due to mn increased speed available 
a mye its onschip hardware multiplier. ice | 


8.8.1 ‘Signed Step Multiplication 


‘The primitive signed step ‘multiplication operation operates on two si igned | 

_ numbers. One of these numbers, the multiplier, is initially in NEXT, and the 

other, the multiplicand, is in the MD register. me product 1 is a ee re 
piecsion pumber on the stack. | 


Prior to performing si ‘ened Step multiplication, both CRO and cRi should be 
_ initialized to zeros and TOP. should be initialized to zero.. TOP may optionally _ 
_ be initialized to a signed number which will be added to the Product. 


If step si igned multiplication is interrupted, the result may be incorrect. | 
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8.8.2 Signed Step Multiplication Op Codes" 


a. 


_- Signed Multiplication Steps 1 Through 15 


~A89D = 1010 1000 1001 1101 
.aaar = 1000 © = 
Vy = 10 = 
‘sssS = 1101 = 
If NO = 


TOP + MD --> TOP — 
TOP:NEXT / 2 --> TOP:NEXT 


; ce : . ‘ 
Signed Multiplication Step 1 


_ A49D = 1010 0100 1001 1101. 
aaar = . 01000 = 
yy. = 10 = 
sssS =. 1101 = 
If NO = 


_ TOP - MD --> TOP 
TOP :NEXT / 2 --> TOP :NEXT 


> ALU OP = TOP + MD --> TOP. 
: YES: = NO 

shift = Bean! signed - 
> ALU OP = TOP - MD -=-> TOP 
> YES = NO | 
> shift = 


S cD2/" signed 


This step differs from *’ in that MD is conditionally subtracted from TOP 
Anstead of added. This is because the value originally in NEXT is in 
two’s complement form and as such the most si igmificant bit a a 
: negative multiple of a power of 2, bes 243, 


“883, Signed Step Maltilicati Example Program 


HEX _ 


:M* (non aed } ae 


CR@ DUP 2* 0<'10 AND OR >R i 


CR@ 10 OR cR! 
MD! , 

0 

O+ 

Qe 

a 2 a 2 2 krook? *? 
A Sr or or oe a ea 2 


R> CR! ; 
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2A AA ASS 


Save State “BE int disable bit, . 
Disable interrupts 


.Set up MD. 


Set up TOP 


‘Clear CRO 


Clear CR1 


Perform the multiplication 


Restore tne-areapia Bie 


m1 


8.8.4 Mixed Sign Multiplication Type A 


Type A mixed sign multiplication is similar to signed multiplication except that . 
_ the value originally i in NEXT is treated as a 16-bit unsigned integer. Because of _ 
this, the last step is the same as the _ 15 steps, which are the same as for | 
signed multiplication. | 


~ This type of multiplication is useful for calling the partial Product of fa 
| aun precision multiplication. 


| Because all the multiplication steps are the same, the operation c can be micaned . 
| to disable interrupts, which reduces the overhead for this type of multiplication. 


8. 8.5— Type A Mixed Sign Multiplication Example Program: 


: MA* (un-—-d) 


MD! \ set up MD | 

0. \ Set up TOP 

0 + _\ Clear CRO. 

ane _ _ \ Clear CR1 

F-OF( *! ¢ | \ Perform the multiplication 
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8.8. 6. Unsigned Multiplication 


me Unsigned multiplication is similar to “signed multiplication except that ‘the 
multiplier and multiplicand are both treated as unsigned 16-bit values. Also, 8-bit 
unsigned multiplication is supported. This allows a faster pulplceion in the | 
“event the multiplier. and multiplicand. are both 8-bit values. | 


Since the raultiplicand (MD) is aunts positive in unsigned multiplication, CRO 
is added to a zero bit instead of the most significant bit of MD in the the 17th-bit 
adder. Otherwise unsigned multiplication is similar to signed multiplication. 


| 8.8.7 ; Unsigned Multiplication Op Codes 


ut? 

Unsigned Multiplication steps 1 Through 16 

A89C 1010 1000 seer 1100 | . 
TOP + MD ==> TOP 


YY = 102~*«* YES = ND 
-If N@ = 


TOP + MD --> TOP 
TOP:NEXT / 2 --> TOP:NEXT 


Bute act . | 
8-Bit Uiieloned multiplication ese through 8 


A894. = 1010 1000 1001 0100 . —. _ | 
aaar = — 1000 ==> ALU OP = TOP + MD --> TOP 
YY, 10 ==> YES =T | ae 
sssS = ~.. 0100 nae shift. = c2/’ unsigned 
If T = 1 
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8.8.8 | Unsigned Multiplication Example Program: 


: UM* (u u -- du 


MD! , os \ Set up MD 
0 , \ Set up TOP 
* OF. \ Cléar CRO 
gee \ Clear CR1 
Ur?! Uk! Ue! UF Yee Uk! Yee we \ 


Perform the multiplication 
U*? Ur? U*" U*! Ux*? Ux? UK U*? : . : 


8.8.9 8-Bit Unsigned Multiplication Example Program: 


: BUM* ( bu bu -- u ) 


SQ! : | \ Set up multiplicand 

O + | : \ Clear CRO. : 

24°" 2/ | | ar \ Clear CR1 | : 

8u*’ 8U*"’ 8U*! 8U*! - 3 \ Perform the multiplication 


sux’ sux” Bux" BUN ; 
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8. 8. 10 Mixed Sign Multiplication Type B. 


- Type B mixed Si ign multiplication is similar to unsi leads multiplication except that 
the value originally in NEXT is treated as a 16-bit signed integer. Because of. 
this, the last step is different (i.e. MD is subtracted from instead of added to TOP) © 
from the first 15 steps, which are the same as for unsigned multiplication. This — 
is for the Same reason that a different step iS required in Si igned multiplication. 


This. type of multiplication iS useful for calculating the partial product of a 
Pe ae pee ulpycanen, and is also er for 8- ‘bit ab operanes: a 


8. 8. 1 Mixed Sign Multiplication Type B Op Codes 


‘Ue 
Unsigned Multiplication step 16 


-A49C = 1010 0100 1001 1100 an: ; _ 
aaar = 01000070 ==> ALU OP = TOP ~ MD -~-> TOP ~ 
YY = 10 ==> YES = N@ . 
sssS = 1100 ==> shift. = cD2/’ unsigned 
IfNp = 1 


TOP - MD --> TOP 
<< TOP:NEXT / 2 --> TOP:NEXT 


: Sow | Oo 
-  B-Bit ‘Ghwignea suteipiigation Step 8 . 


A494 = 1010 0100 1001 0100 . . . 
aaar = 0100 ==>. ALU OP = TOP - MD --> TOP . 
yY = 10 : ==> YES = TO = — a 
sssS = (0100 ==> shift = c2/’ unsigned 
‘If£TS = 1 a 

TOP --MD --> TOP 


TOP / 2 --> TOP 
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8. 8. 12 Type B Mixed Sign Multiplication Example Programs: | / 


: MB* 7 nu-- a ; 


_MD! \ Set up MD 
0 “" \ Set up TOP 

-O + \ Clear CRO 
2*? ne \ Clear CR1 
Ux? Ux?! U*? Ux? U*! u*? U*? UK? \ 


Perform the ; multiplication 
“key yee ue ue utr Ue UM; 


8. 8. 13 $-Bit Unsigned Multiplication Example Program: 


: BMB* ( b bu pa a. | 
- SQt.- . . \ Set up multiplicand 


255 AND a5 - & 5 \ Make 8=bit negative numbers . 
O + a \ Clear CRO 
2*! 2/ © | re \ Clear CR1 


sux’ Bue! BUX! BURT ae _ -. \ Perform the multiplication : 
sux’ Bue? Bue! BUM 5 | a 
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-, 8. 9. Step Division 


Step division ; is perfonned i in 16 steps using long division, and signed division is 
not supported. , | Me eae , 


A 32-bit dividend in TOP and NEXT is ; divided by a 16-bit divisor in MD, | 
= leaving a 16-bit remainder in TOP and a 16-bit quotient in. NEXT. | . 


“Because the quotient iS limited to 16 bits, not all dividends and divisors yield a 
quotient small enough to be represented. In these cases, the result is invalid. The 
ee a ~. . only way to check the validity of a result is to multiply the quotient by the 

, ae € * p OO IN divisor and add the remainder. Also, division by zero yields an invalid result. 


7 Two versions of the division program follow. The santiard version 1 (see. Section — 
8.9.1) tests the value of CR1 for steps 2 through 16. The alternate version — | 
Section 8. . ) does not. | 


It can be. proven that CRI will always be zero for any division that yields : a valid 7 
result, so the standard and alternate versions both work the same as long as the _ 
result is valid. in. the event the result i is invalid, however, their results may differ. | 


There is ‘also an 8-bit version of step division, which is faster than the 16-bit | 
__ version. In the 8-bit version,.a 16 bit unsigned number in TOP is divided by an. 
. 8-bit. unsigned number in MD, leaving an 8-bit result in TOP. The same ~ 
restrictions that ' apply to ‘16-bit eee a apply to 8-bit division. | | 
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8.9.1 Step Division Opcodes 


U/l’ 
Unsigned | Divide step 1 


_ A41A =.1010 0100 0001 1010 | | 
aaar = _ 0100 ==> ALU OP = TOP - MD --> TOP. 
yy = —. 00 = ==> YES ~ = C16 | 
‘sssS = 1010 ==> shift = D2*’ unsigned 
If c16 = 1 - Ss  ( TOP = MD >= 0 ) 


TOP - MD --> TOP | 
- TOP:NEXT * 2 ==> TOP!:NEXT 
YES --> N@ | 


U/? : 
Unsigned Divide Steps 2 through 15 


A45A = 1010 0100 0101. 1010 - 3 

aaar = _ 0100 ==> ALU OP = TOP - MD --> TOP - 
- yy = 01 ==> YES = C16 + CRI 

sssS = 1010 ==> shift = D2*’ unsigned 
ORETCIG 4 CRD al 4 | ( TOP - MD >= 0 ) 


TOP - MD -~> TOP 
~TOP:NEXT * 2 --> TOP:NEXT 
YES --> ND 


- This step differs from U/ 1’ in that the conditional aiberacaot will be 
performed also if the previous subtraction also. produced a result with 1 | 
in the most significant bit (now shifted into a even oe the 

| subtraction was successfully performed. 


It can be proven that this only happens when the dividend i is too ae to 
produce a 16-bit quotient when divided by the current divisor. | 
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Use 
Unsigned Divide Step 16 


A458 = 1010 0100 0101 1010 a .. 3 
. aaar = 0100 ==> ALU OP = TOP =— MD --> TOP 
yy = tain IOV ° _ ==> YES = C16 + CR1.. 
-sssS = 1010 ==> shift = N2*’ unsigned 
If.clé+cCRlL=1 (TOP = MD >= 0) > 


TOP - MD --> TOP 
NEXT * 2 --> NEXT 
YES --> NO 


This step differs fom U/’ in that ‘alle NEXT iS shifted. This allows the 
correct 1 remainder to be at in TOP. 


a 
Alternate Unsigned Divide Sten: ‘16 


' A418 = 1010. 0100 0101 1010: os “s 
aaar = 0100. ==> ALU OP = TOP - MD --> TOP 
yy . = 00 ==> YES = C16 . | 
sssS = 1010 ==> shift = N2*° unsigned 
If C16 = 1 _ ( TOP - MD >= 0 ) 


TOP - MD --> TOP 
NEXT * 2 -=> NEXT 
. YES. --> N@ 3 


This step differs from U/L j in 1 that omy: NEXT i is shifted. ‘This allows a 
| correct remainder to be left in TOP. | 


BU/! : ; oe 
-8-Bit vheiyded® Divide Steps 1 through 8 


A412 = 1010 0100 0001 0010 
aaar = 0100 | ==> ALU. OP = TOP - MD --> TOP 
“yy = » 2°00 ™ + ==> YES = C16 a 
sssS = 0010 ==> shift = Q*! unsigned ne 
Te Cibo | YS (TOP - MD >= 0 ) 


TOP - MD --> TOP 
TOP * 2.--> TOP 


- YES --> TQ. 
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8 9.2 ‘Standard Division Program Example 


This version of the division em tests the value of CR1 for steps 2 through 


16. 


: UM/MOD ( ud u -— ur ug ) 


MD! 

D2* 

oe 7 
Us! U/! U/? U/! Us! 


Us? Us! US) U/? Us 


U/’ U/? Us! U/? 
u/" : 


-SWAP ; 


AA A 


Set up divisor 


Clear CRO, 17 ahvede: ie not needed 
Step 1 | 
Steps 2 - 15. 


Step 16 
Put quotient, remainder in fig places 


8.9. 3 Alternate Division Program Example 


“This version of the division Program does not test the value of CR1 for steps 2 : 
— 16. | 7 


 UM/MOD ( ud u -~ ur uq ) 
_ MD! © . Set up divisor 
Clear CRO, 17 divide gebea not needed: 


Steps 1-- 15 


D2* . 
U/1" U/1! U/1S. U/1" 

Us)? U7" U/1E Us = 
"Us/AS U/L 0/7 

U/1’ U/1" U/1! 
U/i8 eo SO he \ Step 16. ; 
SWAP ; os “= \ Put quotient, ‘remainder in Heke places . 


8. 9. 4 8 Bit Division Program Example 


This version of the division program also does not test tiie valle of CRI ‘be steps 


1 through 8. 
| : 8U/ f ud u -- ur uq ? ep oe IS 
SQi- , . i .\ Set up divisor 


2* | | |  \ Clear CRO 
8U/’ 8U/’ 8U/" 8U/" | \ Divide it . 
8U/’ 8U/" 8U/"’ 8U/! | 7 . 


FF AND ; \ Discard remainder Weg 
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a 8.10 Step Square Root. 


. The RTX 2000 Series of Microcontrollers implement an exact algorithm (vs. an 


: approximate algorithm such as Newton’s oo for Reding the square root of 
an integer. | | | | : 


In the event the iignat value is not a perfect square, a root and a cemaitiicr aie E 


~ found. The remainder is similar to the remainder of division: if the remainder is 


added to the square of the root, the original input value is found. It is possible | 
for the remainder to exceed the 16 bits of the TOP iceisiets ae the 17th bit | 
will always be contained i in CRO. — . & 


- Prior to executing the Square root steps, the 32-bit input vane 1S placed into TOP — 
and NEXT, MD is cleared, and a value of 8000 hex is placed i in SR... 


7 Square root steps are similar to division steps but cabiciet the SQ register instead 

of MD from TOP. They also use CR1 to determine whether to perform the — 

conditional subtraction. This is needed for Square root ‘Steps because the value 
being subtracted changes from Step to step. | | 


~ Step Square Root Algorithm 


That is, it consists of repeatedly saberacing a subtrahend from the input value. | 
If the result of the subtraction is negative, however, the value prior to the 
~~ subtraction is restored. This can also be thought of as a conditional subtraction. 
. The difference from long division lies mostly in the value of the subtrahend, | 
which changes from step to step depending on which of the previous subtractions a 
were successful.’ A subtraction is successful if it leaves a non-negative 


- intermediate result. If a subtraction is successful, a one is shifted into the least 


significant bit of the result; otherwise a zero is shifted in. To get a feeling for _ 
how the subtrahends are generated, let us consider the 8 bit square root of a 16 


. bit number using hardware similar to that in the RTX. The bits of the root are __ 
represented by 1, with r, being the most significant bit. The trial subtrahends are . 


shown aligned with the square from which they are to be subtracted. 
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ee 


0 0 0 
0 0. 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
r 0 0 
1 0 1 


_As is evident from the table, each trial subtrahend is dependent only on the results 
of the trial subtractions executed prior to the current trial. The values in this table _ 
are also the same as the first 8 subtrahends for the 16-bit Square root, and it 
should be apparent how the remaining 8 subtrahends are generated. These 
subtrahends are generated directly by MD, SR and the surrounding circuitry. | 


‘Unless you are a mathematician, it is probably not obvious why subtracting these 


values should: give you the square root of a number. The remainder of this 2, 


; section may help you to understand how this algorithm works. 


Consider two numbers, r (an N-bit unsigned integer) and S (an unsigned integer 
with 2N bits). Let s be the square of r. One can represent ras a Epolyiemal of 
powers of 2: A | 


r= Ln an read 4 teoet 242 + or 

where n=N-1 and Ip through r r, are ‘eitfier zero Or one. "Since S is ; the square of r, 
S can also be oot as a polynomial of powers of 2: i | 

rr i 


 bege + ror. gant ee ae ae 
[faatn g2n-i + X. 222 24 owt Yosree | 4s 
e ze +. i ‘ ; 


“[rorn2” + Yoln-12”* oat rg). 
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Notice that since r, can only equal 1 or 0, r,r, = I, It is useful to visualize the 
terms of s as values ina square array, as shown i in Table 8.8: _ 


| TABLE 8.8: 8: TERMS. OF OF S AS V AS VALUES INA IN A SQUARE ARRAY 


i 
re 2" r nto2" 


Lqeat a2” Ppt e2™ fic 


r,2? 


T.tee. 


Notice that all the perfect squares are on the diagonal running from the upper left 
comer to the lower right comer, and the rest of the array is symmetrical about the - 
diagonal, and each term in the lower left half of the array has an identical term 
in the upper right half. Like terms can be combined by adding the terms in the . 
upper right half to those in the lower left half. Each row in the new triangular 
matrix may then be used to form a trial subtrahend, lo through t» that may be 
used for finding the square root of s: 


y= 2" 

2tn-1) 
tia = yr27 + : oe 2 a . 
Cn-2 = r Ent n-22" oe Eealaee" ra + Lace 


t, "2 eerier + re ary2m ke ee ee a Yee 


ty = raro2tt® + raaro2" + -+ rir 92? + Yo. 


~ Notice that t, depends = On Ty» ty depends only on fs and ane ta depends ae : 
| OB Fata BOO Faz ARE ISOLON:, ” : ae Ea oe 


The technique for using these trial subtrahends for finding the root of sis 
; straightforward. If r, is assumed to be 1, and t, can be subtracted from s, then r, _ 
is indeed 1. If, however, the result of the subtraction is negative, r, is 0, and s 
must be restored to its value prior to the subtraction. Once r, is known, it can be 
used to find r,, by the same method, and so on to 1». This is oa? what | 
eee when submacuns the trial subtrahends in pee 8.7. | 
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. For further illustration we can find the subtrahends for the 8 bit square root in 
? table 1. In this case the root is: 


r= 2? vere + ore 4 ro" ra “r,23 + x22? + r,2 ‘+ ry . 


The square is: . 


i - ae = 
oo = [r,2>* + r4r,s* + eee + Y7Fo2’] + 
Erg ,22? + 162227 + 22. + er,28] +: 


(rer2? f For 2 + wee + To) : 


By combining like terms and grouping into subtrahends \ we obtain: | - 


és r,2¢4 
eg je pero a et 
t, = rorg2** + xer.2** + -x52** 


Fe = Cue + one ese + Estee + ro. ’ 


7 which i iS identical to the description of the subtrahends in Table 8.7. | | 
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An Example: 


For a specific example, let us. find the 8 bit square root of a 16 bit number. Let 
| the mat value equal 8163H erro 1.€. 1000000101100011 ee 


10110101 - 
y1000000101100011 
— =-0100000000000000 lst subtraction 
~ 0100000101100011 Successful 
-0101000000000000 2nd: Subtraction 
1111000101100011 Not Successful | 
0100000101100011 Restore Previous Value 
-0010010000000000 3rd Subtraction 
0001110101100011 Successful 
-’=0001010100000000 4th Subtraction 
0000100001100011 Successful 
-0000101101000000 5th Subtraction 
1111110100100011 Not. Successful £ tip. 8 
_. 0000100001100011 Restore Previous Value 
-=0000010110010000 6th Subtraction: 
0000001011010011 Successful . 
-0000001011010100 7th Subtraction 
1111111111111111 Not Successful 
0000001011010011 Restore Previous Value. 
--70000000101101001 8th Subtraction 


-0000000101101010. Successful, Also Remainder 


| Note that the remainder i is a 9 bit value though the root is an 8 bit value. This 
is. S-possible because the difference between two successive perfect ae iS: 


(ns)? é nt = 6 One i) ont eae 


Therefore the largest sesaible remainder, ‘which i is one less than this difference i is 2 
2n, which requires one more bit to represent than n does. 7 
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8.10.1 


256 


ASIA 


YY 


| Step Square : Root Opcodes 


: gi’ 


Square Root Step 1 


‘ 3’ 


= 1010 0101 0001 1010 
-aaar = = 0101 _~—. ==> ALU op = TOP - sQ =e TOP: 
= 0O - . ==> YES = C16. 
sssS = - 1010 ==> SHIFT = D2*? “UNSIGNED _ 
If clé=1 (TOP - SQ >= 0 ) 
TOP - SQ --> TOP # ae eo beats 
_ TOP:NEXT * 2 --> TOP: NEXT 
YES --> NO> : . 
MD. + SR --> MD. 
SR / 2 -- SR 
| Square Root Steps 2 Through 15 
‘A55A = 1010 0101 0101 1010 | aaa | 
aaar = - 0101. ==> ALU OP = TOP -.SQ --> TOP 
yy = 01 ==> YES. = C16 + CR1 | 
sssS = 1010 ==> SHIFT = _D2*’ UNSIGNED | 
If cl6 + CRI = 1 | ( TOP - SQ >= 0 ) 


TOP -— SQ --> TOP 


-TOP:NEXT * 2 --> TOP:NEXT 
‘: YES --> NO 


MD + SR --> MD 
SR. 2-- SR 


s differs from st’ in that CRI is considered to jet-anie YES. This . 
is necessary because sometimes a successful subtraction will result in © 
the most significant bit being l Once this bit i is shifted into CRI, it 


cannot be subtracted. 


: .s" 
_ Square Root ‘step 16 | 


A558 = 1010 0101 0101 1000 - . oes . 
aaar = 0101. ==> ALU OP = TOP. - so -->.TOP 
“yy = O01 |... ==> YES = C16 +CRl. - 

 .sssS = ; 1000 | ==> ‘SHIFT. = N2*? UNSIGNED 
If C16 + CRI = 1 a 7 TOP - SQ >= 0 


- TOP - SQ -=> TOP» 
NEXT * 2 --> NEXT 


“YES --> NO 
- MD +: SR’--> MD 
~SR./ 2 == SR 
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Ss" differs from S’ in ‘that only NEXT is shifted instead of both NEXT 
and TOP. This i is done so that the remainder i is correct. 


- peir es 
. 8-Bit ‘square 1 ‘Root step 1 
A512 = 1010 0101 0001 0010 oMe e i 
- aaar 0101 ==> ALU OP = TOP - SQ -=>-TOP © 
yy. “Bs <3, 500 | ==> YES. =Cl6 . 
- sssS 0010 ==> SHIFT = 2*? UNSIGNED 


‘Tf C16 = 1 Go © ¢ ete Be % ( TOP = SQ >= 0). 
_ TOP = SQ --> TOP”. . Fay * 
TOP *-2 -—-> TOP . oe 
YES --> TO. 

MD + SR -=-> MD 

SR / 2 -- SR. 


ps’ 
. 8-Bit Square Root. scan 2 Through 8 
A552 = 1010 0101 0101 0010 mo, pan #8 
/ . @aar , 0101 | ==> ALU OP = TOP - SQ --> TOP 
YY, 01: © ==> YES C16 + CRL , 
sssS° ' 0010 ==> SHIFT =.-2*/.. UNSIGNED 


If Cl6+ CRI =2 5 ©. 99° (TOP =.SQ°>= 0 )- 
-  TORP-= SQ --> TOP -* 0 02. re ce aa 
. TOP. * 2.=—> TOP 
YES -->-TO 
MD + SR --> MD _ 
SR / 2 -- SR 
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8.10.2, 


840.3 


Square Root Program Example 


HEX > 
: ROOT ( ‘du - ‘uroct uremainder _ aera 
~ 8000 SR! mo a \ Set up SR 
0000 MD! Set up MD 7 
Dae. \ Get lined up for 1st ‘subtraction 
| | so ar oe Fe “\ lst step. 
s’ s’ s’ s’ s’ s! si. \ Steps 2 - 15. 
/ ge se sh st st se sr ye 
oe s" : ee. \ Last step 
8-Bit Square Root t Program Example 
HEX : 
ae -BROOT ( word -- byte ) ee AS 
8000 SR! aa \ Set up SR_ 
"9000. MD! \.Set up MD . 
2e 50 \ Line up for first subtraction 
BSl! lt. = | \ Step 1 noe 
BS’ BS’ BS’ BS’ BS’ BS’ BS’. \ Steps 2- 8 | a 
FF AND ; os ms _-\ Discard remainder x 25 
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8. 11 Step Bit Reversal 


Bit reversal r reverses the order i in which bits a in a word. For parece 


| Using step math allows bit revenal of a 16-bit word in 23 cycles nae to 48 
cycles without step math. Bit reversal is useful for calculating address during a 
FFT ‘operations: and also is oe to calculate certain types of CRC's S. | 


| 8.111 . Step Bit Reversal Opcodes 


oR 3 
. Bit Reversal Step. 


~ A196 = 1010 0001 1001 0110 | i. ‘ So - 
aaar = ~—*<0001- ==> ALU OP = NO OP, OPERATE. ON SQ 
uu. = © gt AO 2 OSS YES Ce POS see 3 

. sss. = ' 0110 ==> SHIFT = 2/’ UNSIGNED 


Hf TO ae ee 
MD or SR --> MD 
- SR /..2 <-> SR . 
| ‘TOP / 2.-=> TOP 


- 8.11.2 Step Bit Reversal Example Program 

ee REVERSE (u —w ) << 

8000. sR! | 

~ 0000 MD! tt 
R? R’ R’ R! R’.R! RY 
RRO ORT ROR ROR! os ° 


Set up SR 
Set up MD a é uF, 
Put bit reversed version of TOP into MD. 


a 


R’ RS Ba eae — 7 ae 
_ DROP 7. oe Discard garbage in TOP 
“MD@ 7 0 1 ON Retrieve result, | 
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8.12 Step Cyclic Redundancy. Check (CRC) | 


The cyclic redundancy check (CRO) i is used for identification and error ae an 
| of blocks of data, in much the same way a checksum is used. oe ag . 


The advantage of the CRC i is that more errors can be detected with a CRC than - 


by a checksum. For example, a 16-bit CRC can detect all errors ina 16-bitframe 
of a stream of data. There are several de facto standard data transfer protocols, = 
including XMODEM, X.25 and Kermit, that use variations of CRCs for a. - 


| checking. 


The basis for calculating a CRC of : a stream of bits is to hen a modulo-2 ae i 
division of the stream (multiplied by an appropriate power of two) by an 
irreducible modulo-2 nea Ss equouen is discarded, and the Temalndes 
iS the CRC. 7 : - i- | | 


In modulo-2 subtraciion, there is no carrying or enowing | from bit to ) bit, so 


subtraction is the same as a bitwise logical exclusive OR function. The | e. 


polynomial is a value that: cannot be evenly aivdee modulo-2 bd rn | 
. polynomial, much like a prime number. - | 


Most 16-bit CRCs are : éalculated with the polynomial x + x" ee x ate which 
| can 1 also be expressed as 10001000000100001. 


For example, to ayiciiats the XMODEM style CRC of the ASCIL character - cs 
Ae hex), iain the es long division: 7 | - 


1100010000001 000037 OFO10TO00G00OOOOOO0OTGIS ak 
-10001000000100001 | 
-10000000010000100: 
~ 19001000000100001 
~ 10000101001010000 
10001000000100001 


se, ae - 0001101001110001 = 1A71. hex | 


| To calculate thie CRC ofa Stream of characters, ian XOR the CRC’ S of each ; 7 | | 


character with the CRC of the ‘preceding characters. . This can also be 
accomplished by repieeing the 16 os bits of se above dividend Bae the 
previous | CRC. | | 
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One useful aspect of the CRC calculation is that it can be implemented in. 
hardware with 16 shift registers and 3 exclusive OR gates. To see how this is 
, done, think of the above dividend as being i in a L 16- bit shift register. 


When a 1 is shifted out of the register, the contents of the register are exclusive 
OR’ ed: with the 16 least significant bits. of the polynomial. — 


When a zero ‘bit is 5 shifted out, no. exclusive OR’i ing takes place, After er eight ie 
shifts, the register holds the CRC. ae : 


Since many serial data protocols transmit t the least significant bit of the data a first, 7 


many CRC’s are calculated on the bit reversed image of the transmitted character. 


so that the CRC may be calculated in the simple hardware noted above.. In such 
cases, the resulting CRC is also bit reversed and must be un-reversed, though this 
only needs to be done once for each packet of data. pat ge e protocals that | 
_ use one type of CRC are X, 25 and Kermit. | : 


Other variations | on the CRC are to use a non-zero initial value for the CRC, 


usually FFFF hex. The other variation isto exclusive OR the CRC with a non-  _ 


zero value, also usually FFFF hex, before transmitting it, An example of this is. 
the X:25 pee, 


The CRC in RTX step math is hachaiddind such t that bit reversed CRC’s : are ie ie 
generated directly. The result does not need to. be bit reversed in these cases 


because the shifting is done to the right instead of to the left. The polynomial, 
however, must be bit-reversed before exclusive OR’ing it with the data stream. — 
_ This causes no performance loss, however, because it is usually a constant. For. 

‘implementing CRC’s that. do. not use bit-reversal, such -as sD SE: used by 
ree the data and CRC’ $ must be bit reversed. : 


8.12.1. Step ¢ CRC Opcodes i 


<p ae. ; 5 
Cyclic Redundancy Check step . 


| AADE =.1010-1010 1101 1110, | | | ae ae | 
“aaar = 1010, + _ ==> ALU OP = TOP xor MD --> TOP: | 
yy = oF oe oF ==> YES. = TO xor N@ 
= 1110 ==> SHIFT = 


-sssS = 2/* UNSIGNED. 


If°(TO xor NO) =.1 
TOP xor MD --> TOP - 
. TOP:NEXT / 2 --> TOP:NEXT | 
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| 8.12.2 | Step CRC Example Program 


HEX 
0811 CONSTANT POLY _ oN Bit reversed generator: polynomial 
: CRC ( cre byte - -- cre’. | 
SWAP . \ Rearrange data and original CRC 
_ POLY MD! | _-\ Set’ up MD . 
CF Ch Cr CP oCh Oh er Cr \ Calculate new CRC . . 
NIP a | - \ Discard parcial quotient | 


To use this program to calculate a CRC for Kermit or CRC-CCITT, use an initial 
CRC value of zero. To use this program to calculate a CRC for X.25 protocol, — 
use an initial CRC value of FFFF hex and invert the i CRC before - 
transmitting it with the packet. - 


7 The CRC for two characters can be édleuisied at.once 2 by using 16 Cc steps | 
_ instead of eight, while placing the first character Teceived i in the ea significant - 
| eye ot the data word. 


8.12.3 XMODEM CRC Example Program 
HEX aes 
a CRCX: 7d crc byte -- crc’ ) ; 
~ BIT-REVERSE \ Reverse data byte 


~ SQ! MD@ _ ~\ Left justify data byte > 
_ SWAP BIT-REVERSE SWAP \ Reverse original CRC 
TER - 7. \ Calculate new CRC 
BIT-REVERSE. ; a \ Un-reverse new CRC 


Note that in eieaiaiiis the CRC for a long string of eharacters the CRC * only | 

—. needs to be reversed and un-reversed at the beginning and end of the string of 
- characters. | | | 

If the initial CRC is zero, as it is in XMODEM, it only needs a be un-reversed 

at the end of the string of characters. Every character, oN needs to be — 

reversed. , ; 
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8.13. Step Math Reference 


- $igned/Unsigned | — 
Micro Opcode __. 
Field we 


Unconditional shift a 
gtinero pda eh ate : 


_ ‘Gondittenar bahavicr® . 
. Micro Opcode. Field _ 


— Register Selection | 
Micro Opcode Field 


— ALU. Micro opcode’ Field 


| “FIGURE 8. 2: STEP MATH INSTRUCTION FORMAT 


TABLE 8.8: ‘STEP MATH ALU FUNCTIONS. 


Function’ 


‘| No ‘ALU. suacakien: a ae | 
|. YES => TOP and REG.--> TOP . 
YES => TOP - REG ~-> TOP 
~ YES => TOP or. REG --> TOP 
|- YES TOP + REG --> TOP 
‘|. YES TOP xor REG -=> TOP |] | 
| YES => REG - TOP --> TOP- jf 
‘| YES = 1 =>... .' REG me () 2 


iil 

RPP RPP PRP 
: 
Vv 


il 
Vv 


TOR’ tee oi A (ene). - 
TOP (alu. op) MD (shift) — 
“Me SRO 55. Gs 3%, (shift) 

TOP (alu op) SQ: (shift) 
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__ TABLE 8.10: BEHAVIOR OF YES. 


| IF ARITHMETIC THEN co16 ELSE TRO. 


. (IF ARITHMETIC ‘THEN CO16 ELSE aRO) or CRI 
| IF IR3.= 0 THEN TO ELSE. NO” 
i ee xOr NO . ~ 


_TABLE 8.11: srep MATH SHIFT OPERATIONS __ 


: SOME. USEFUL STEP MATH orconEs 


| DESCRIPTION caad 


17: Bit left shift . 
‘| Rotate TOP: NEXT right 
| Rotate’ TOP. right hee 
Signed multiply steps ‘1-15"° 
Signed multiply step 16. 
_Unsigned multiply steps 1- 16 . 
| Mixed sign multiply step 16 
- Byte unsigned multiply steps 1-8 
Byte mixed sign multiply ae 8 
Unsigned divide step 1 to 
| Unsigned divide steps 2- -15. 
| Unsigned divide step 16 i 7 | or 
Alternate. unsigned divide step 16 | 
| Byte unsigned: divide steps 1- “8 | 
| Square root step 1. — 
| Square root steps 2- is” 
‘1 Square: root step 16. 
|. Byte Square root. step 1 
Byte. square. root. nares 2- 8. 
_ | Bit reversal» step - 
ii CRC step 


“4 aos 
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See 


USING THE RTX2000 STACK CONTROLLER 


Two of the powerful features of the RTX2000 are its two on-chip etacka: 
the parameter stack and the return stack. Because thease stacks can oper~ 
ate simultaneously with the memory data bus and and ASIC data bus, thay 
increase the performance ef the RTX2000 by maximizing the quantity of data 
that can move in one précassor oycle (refer to the RTX2000 block 

diagram). In addition to increasing the apeed of the RTX2060, the stacks 
can interrupt the/ processor in tha event of a stack overflow or underflow. 


PARAMETER STACK STRUCTURE 


The parameter stack is a 258 word by 16 bit stack that is used for storing 
data and addresses for arithmetic and logical operations. The top of the 
parametar stack is the TOP register in the RTX processor core, the next 
stack location is the NEXT regigter in the RTX core, and the remaining 256 
legations are in on-chip RAM controlled by an RTX stack controller. 


RETURN STACK STRUCTURE 


The return stack is 4 256 ward by 21 bit stack that is ugead for storing 
return addresses for subroutine calla and loop counters for certain opera- 
tions. The top of the return stack is in the processor ¢ore index regis- 
Cer (I) and the ramaining lecations are in on-chip RAM centrolled by an 
RTX stack controller. You may hava noticed that the raturn stack has 21 
bits per word. This is partly because tha RTX2000 supports a twenty bit 
address. Howaver, since op codes ara on word boundaries, only nineteen 
bits are required to dafine a return address for a subroutine call. These 
are bite one through nineteen, where bit gero is the least significant bit 
and bit twanty is the most significant bit. Bit zero is used to detarmine 
Whether the address ig for a return from an interrupt (1) or 4 return from 
a gubroutine call (0). This allows interrupte to be enabled whan return- 
ing from an interrupt, Bit 20 is used to store the data page register 
select bit (DPRSEL) which is used to determine which memory paga is used 
for data feteh and store operations. Whan the return stack is used fer 
storing loop variables ( >R and R» }, only bits zare through fifteen are 
uged for data. Bits sixteen through twenty are loaded with tha content of 
the ae page redister (CPR) during an interrupt, subroutine call or =R 
execution, | 7 


RTX INTERRUPT CONTROLLERS 


The RTX2000 interrupt controller circuitry consists of two identical sec- 
tions, one for the parameter stack and one for the return stack. Each 
section has two user accessible @-bit registers that control the operation 
of ite stack: a stack pointar register (SPR) and a stack limit register 
(SLR). Since the data path on the RTX2000 is sixteen bits wide, the SPR 
for thé parameter stack and the SPR for the return stack are concatenated 
te form 4 single sixteen-bit SPR for the purpose of accessing them by the 
processor. Similarly, the SLR for the parameter stack is concatenated 
with the SLR for the return stack, The SPR may ba read as Well as written 
to, but the SLR is a write-only register. In beth cases, data bits 0 
through 7 apply to the parameter stack and data bits 8 through 15 apply to 
tha raturn datack, | : 


ay 
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STACK OPERATION 


Both stacks have 256 words, numbered from zere to 255. When data is 
pushed onto the stack, it is written to the location above that pointed to 
by the SPR, atid the SPR igs incremented (thia is equivalent to a 
pre-incrament). When data is pepped off the stack, it is read from the 
lecation pointed to by the SPR, and then the SPR is decremented 
(post-decremant). When tha RTX2000 is reset, the SPRs are reset to sero, 
so the first location written to will be location ona. If an attempt is 
made to pugh data onte a stack beyond word 255, the stack will wrap around 
and start pushing at lecation zero. Similarly, if an attempt is mada to 
pop data off the stack below location zero, the stack will wrap around tc- 
lecation 253. For this reason the parameter stack can be used as a 
recireuluting 256 word buffer. Also, because the stacks are circular, 
they must be managed carefully to prevant overflows and underflows from 
producing incorrect results. The stack controller helps to accomplish 
this by generating overflow and underflow interrupts. | 


STACK ERROR INTERRUPTS 


The RTX stack controller requests an overflow interrupt anytime the value 
of the SPR is greater than the SLR. Since the SPR can naver exceed 255, 
the stack controller cannot generate an overflow interrupt with tha SLR at 
ite raset value of 255. Therefore the user should set the SLR to a value. 
legs than 255, which alse provides an overflow buffer that allows for the 
axtra oycles required to acknowledge the interrupt. Another point about 
using overflow interrupts is that a push-pop sequence can generate an 
interrupt request that goes away before it is serviced. Tha resuit of 
this i# that tha interrupt controller causes the processor to execute code 
pointed to by tha "no interrupt" veetor. This is referred to as a phantom 
interrupt. Therefore, this location should always be initialized to a 


valid code sequence, even if it is only a "no-op, return" sequence. 


The RTX stack controller handles underflows aimilarly to overflows. The 
controller requests an underflow interrupt when data is popped off loca- 
tion one, Algo, if data is pushed onto the stack until it wraps around, 
an underflow interrupt will be generated when data is pushed onto location 
zero, Because popping data eff location one causes an interrupt and loca- 
tion one ig normally the first stack location used, the interrupt generat- 
ad when data ia popped off location one would occur evan though the data 
ie valid. Therefore, whenever underflow interrupts are used, the first 
lecation used should be location two. This is most easily done by pushing 
dummy data to location one at start up. The "DUP" instruction will aeccom- 
plish this for the parameter stack, and the sequence "DUP >R" will accom- 
Plish this for the return stack. Because pushing data onto location zero 
Will request an underflow interrupt, the validity of the stack can be 
managed with jugt the underflow interrupt, while tha overflow interrupt is 
ugeful for managing a virtual stack greater than 256 words or multitask-~ 
ing, The underflow interrupt only methed also allows the greatest number 
ef words to be used (284 out of 256) while using stack error interrupts. 
Tf etack error interrupts are not used, the entire stack is usable. 
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The underflow interrupt also can exist for a short anough time to produce 
phantom interrupts. This can happen if a pop from lecation one is immedi- 
ately followed by a push to location one. For this reason, it is neces- 
sary to provide valid code at the memory location pointed to by the 
nowinterrupt vector, | 


USING THE STACK POINTER REGISTER 


It is podsible toe perferm stack manipulations by weading and writing the — 
SPR. Because of the intricate interaction between the SPR and the stacks, 
great care must ba exercised to prevent the processor from getting lost. 


READING THE STACK POINTER REGISTER 


The stack pointers are both read in a single cycle by executing a SPRE 
instruction. The parameter stack pointer is retrieved in the least 
significant byte and the return stack is retrieved in the most significant 
byte. The value obtained is not exactly the value of tha stack pointers 
‘prier to the SPR@ cycle, however. The outputs of the stack pointers 
normally pases through incrementers on their way to being read. In 
addition to this, pushes and pops on aither stack during the SPR@ cycle 
cause the stack pointers to change in the middle of the cycle. This can 
quuse additional inerements and decrements to the values obtained and. 
ghould, therefore, be avoided. For example, returning from a subroutine 
during 4 SPR@ cycle causes the value for the return stack to he 
decremented instead of incremented. | 


Reading the SPR usually resuita in a push on to the parameter stack. This 
caugsas the parameter stack pointer to be incremented in the middle of the 
QSPR cycle. The output of the incremented stack pointer then passes 
through the incrementer bafore it is ready The regyult is that the value 
obtainad is the value of the parameter stack prier to executing thea SPRG 
plus two. Because of the number of steps that must be performed on the 
value from the stack pointer in the last half of the SPR@ cycle, it is 
recommended that the stack pointers be read without performing any stack 
Operations during the fetch cycle. This can be done as follows: 


1. Do not combine 4 SPR@ with a raturn from subroutine. 


2. Combine the SPR@ with an arithmetic or logical function to prevent the 
rasult from pushing the previous top of stack down on the stack. An 
effective way to obtain the values of the stack pointers follows: 


HEX \ BASE 16 FOR THIS DISCUSSION 
=102 \ CORRECTION FACTOR : 
\ RETURN STACK: 1 
\ PARAMETER STACK: 2 
SPR@ + \ SINGLE OP CODE: NO STACK OPERATIONS 
NOP ; \ IF A RETURN FOLLOWS, MAKE IT A SEPARATE OP CODE 


Oy 
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WRITING TO THE STACK POINTER REGISTERS 


Since stack operationa de not change the value stered in the SPR, it may 
be written to by a simple SPR! instruction. There is one thing to watch 
out for, however. If a SPR! is combined with a return from subroutine, 
the value popped off the return stack into the I register will be 
determined by the# value in tha SPR prior te the execution of the SPR! 
jnatruction. 


